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As  the  feature  sizes  ol  Yerv-Earge- Scale- Integra  ted  (VLSI)  circuits  continue  to  decrease, 
the  timing  periormance  ol  a  design  cannot  be  estimated  accurately  without  introducing  the  sig¬ 
nal  delay  due  to  interconnect  parasitics.  Modeling  interconnect  parasilics  directly  from  a  cir¬ 
cuit  layout  is  therefore  emphasized. 

In  this  research,  two  programs.  EEMRC  and  HPEY.  have  been  developed  to  investigate  the 
following  areas:  (1)  interconnect  modeling.  (2)  hierarchical  parasitic  circuit  extraction,  and  (3) 
collapsing  technique  for  interconnects.  The  EEMRC  is  a  two-dimensional,  finite-element  pro¬ 
gram  which  computes  the  resistance  or  the  capacitance  from  the  user-spec  died  geometry.  Since 
the  equation  formulation  for  EEMRC'  is  based  on  a  finite-element  method,  there  is  no  shape  res¬ 
trictions  on  dielectric  interlaces  or  conductor  geometries  In  resistance  calculation,  a  quasi- 
three-dimensional  elfecl  ol  contact  resistance  is  also  taken  into  account.  The  program  HPEX  is 
a  hierarchical  parasitic  circuit  extractor  which  takes  the  C1E  layout  description  as  an  input  and 
generates  a  SP1U1  input  with  different  details  of  interconnect  parasitics.  In  this  extractor, 
analytical  formulas  fitted  from  numerical  data  are  used  to  model  interconnect  parasitics  ol 
AT  SI  circuits  in  order  io  compromise  between  the  accuracy  and  the  computation  lime  Simula¬ 
tions  show  that  by  carefully  fitting  data  analytical  formulas  can  be  verv  accurate,  especially 
when  the  interconnect  region  is  fairly  regular.  I.ayout  partition  technique,  which  facilitates  the 
interconnect  modeling,  is  also  studied  and  implemented  in  HPEX.  I  inallv.  a  new  node  reiluc 
tion  technique  which  accurately  reduces  parasitic  RC  networks  is  studied.  Tins  technique  is 
capable  ol  reducing  a  large  volume  of  interconnect  data  into  a  manageable  size,  thereby  sub 
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INTRODUCTION 


As  the  density  and  complexity  of  Very-Large-Scale-Iniegrated  (VLSI)  circuits  continue  to 
increase,  detecting  errors  in  circuit  and  layout  designs  manually  becomes  almost  impossible. 
Hence,  to  ensure  error-free  circuit  designs  before  being  sent  to  the  expensive  fabrication  step,  it 
is  important  to  have  Computer-Aided-Design  (CAD)  tools  to  perform  design  verifications  rang¬ 
ing  from  the  behavior  level  to  the  circuit  level.  In  general,  extensive  verification  steps  should 
be  taken  on  the  different  levels  of  circuit  designs  to  make  sure  that  the  functionality  and  per¬ 
formance  of  a  VLSI  system  meet  the  user's  specifications.  A  flowchart  describing  different  lev¬ 
els  of  designs  and  their  verifications  is  shown  in  Figure  1  1. 

In  order  to  handle  the  complexity  problem  inherent  in  VLSI  designs,  it  is  a  common  prac¬ 
tice  to  design  a  VLSI  system  hierarchically.  Once  a  VLSI  system  is  constructed  in  this  fashion, 
it  is  natural  to  exploit  the  design  hierarchy  in  verification  tools  from  the  computer  lime  point 
of  view.  Figure  1.1  also  shows  several  levels  of  the  hierarchy  in  the  top-down  design  and  their 
corresponding  verifications.  However,  at  each  level  of  the  top-down  circuit  design  the  func¬ 
tionality  and  performance  of  the  system  cannot  be  guaranteed  unless  feedback  from  physical 
designs  is  taken  into  consideration. 

1.1.  Functional  and  Performance  Verification  of  a  VLSI  System 

In  verifying  a  VLSI  system  or  circuit,  two  critical  issues  must  be  emphasized: 

( 1  )  functional  verification  and 


(2)  performance  verification. 


functional  verification  can  be  defined  as  verifying  a  design's  functionality  to  ensure  that  the 


design  performs  the  function  as  originally  specified.  Different  levels  of  functional  verification 

,  -f 


ranging  from  the  behavior  level  to  the  circuit  level  are  usually  needed  during  the  top-down 


design  phase. 


£ 


9r 


In  each  level  of  the  functional  verification,  tor  the  purpose  of  testing  the  functionality  t 
a  design,  the  designer  should  provide  different  input  test  vectors.  Analysis  programs  such  as 
functional  or  logic  simulators  an  be  classified  in  this  tvpe  ot  verification  However  I  a  desicn 
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I  u'.'.i  ..  . .  npi.r'  r  :  !;r  vft.p  design 

. r  pose  :  :  re  per'  rmjn«.f  .  erihvdti<-n  l-.-r  a  \  !  SI  vp.:p  is  :<•  ensure  the  required 
,r;  0  'his  vh:p  usual!  v  ^pec  died  h.  the  designer  1  he  cpeed  at  x  huh  j  design  van 

;v:  iV  >  s':  re,.  dependent  r  '!ie  v.nderlvmg  process  technology  and  the  circuit  design  sf,  !e 
1  '  e  ■  v  ..  :  Vv  •ir.'  l  ig  .  cenerulf.  determines  the  inhetrnt  device  de!a\  .  w  hich  in  turn  plays  the 
tv '-t  .Ttp.'Man'  part  ;n  the  operating  speed  of  a  \  I  SI  system  There! ore.  the  correct  estimation 
t  . :  r  v  u  1 1  element  delays  including  both  active  am!  passive  devices  is  a  kev  factor  for  the  suc- 
■-  es  ;  Li  I  performance  verification  ol  a  chip  design 

Since  the  lower  level  implemenution  has  a  significant  impact  on  a  VLSI  system's  speed, 
'.tie  correct  abstraction  from  the  lower  level  to  a  higher  level  delay  model  is  imperative  for  the 
higher  levels  ot  performance  verification  If  a  design  synthesis  tool  such  as  a  silicon  compiler  is 
-diii/ed  to  directly  generate  the  physical  layout  of  the  entire  chip  from  the  behavioral  descrip¬ 
tion.  the  silicon  compiler  has  to  perform  timing  estimation  at  various  levels  of  the  design  in 
order  to  guarantee  the  speed  requirement  of  the  design.  Because  the  delay  concept  is  bottom-up 
and  the  silicon  compilation  is  top-down,  a  timing-driven  silicon  compiler  needs  feedback  from 
•he  lower  levels  *o  the  higher  level  synthesis  steps,  unlike  generating  the  correct  functionalitv 


t  a  design 


1.2.  Physical  Parameter  Extraction 

One  of  the  critical  issues  in  the  performance  verification  of  the  lowest  level  of  a  VLSI 
design  is  to  correctly  identify  both  transistors  and  interconnect  parasitics  in  the  physical  lay¬ 
outs.  This  process  is  known  as  circuit  extraction.  Several  programs  [ 2 - 1  ()]  have  been  imple¬ 
mented  to  emphasize  various  aspects  of  circuit  extraction  depending  on  the  verification  tool  to 
be  used  in  the  next  phase,  namely,  the  simulation  phase.  Basically,  three  types  of  information 
can  be  obtained  by  circuit  extraction: 

( 1 )  circuit  elements  such  as  MOS  devices,  resistance  and  capacitances 

(2)  electrical  parameters  associated  with  circuit  elements,  and 

(3)  connectivity  of  circuit  elements. 

Types  ( 1 )  and  (3)  are  enough  for  circuit  schematic  comparison  programs  [  1 1  ].  and  logic  simu¬ 
lators  [  1 2 ].  whereas  all  three  types  of  information  are  required  for  circuit  and  switch-level 
timing  simulators  [13-14], 

Previously,  only  active  devices  were  extracted  from  the  physical  layout  because  they 
were  sufficient  to  predict  the  circuit  performance.  But  in  the  present-day  device  technology 
with  submicron  feature  sizes,  interconnect  parasitics  can  no  longer  be  neglected  [15-17],  In 
order  to  be  able  to  accurately  predict  delays  through  signal  paths  in  VLSI  circuits,  resistances 
and  capacitances  associated  with  the  interconnects  should  also  be  included  in  the  output  of  cir¬ 
cuit  extraction.  Since  the  actual  value  of  interconnect  capacitances  and  resistances  is  strongly 
dependent  on  the  physical  layout,  it  becomes  important  to  find  accurate  models  for  intercon¬ 
nects  in  the  circuit  layout  in  order  to  correctly  estimate  the  effect  of  interconnect  parasitics  on 
VLSI  circuit  performance. 

The  most  difficult  task  in  modeling  interconnects  is  calculating  interconnect  resistances, 
self-capacitances  and  coupling  capacitances  between  conductors.  Recently,  some  techniques  [3- 


5  10.  1&-20]  have  been  developed  10  model  such  interconnect  parasitics  from  circuit  layouts. 
One  of  them  is  to  manipulate  layout  data  first  ( 1  <S ] .  then  use  a  process  simulator  [ 2 1  ]  to  calcu¬ 
late  parasitic  electrical  parameters  This  technique  can  simulate  the  effect  of  process  parameters 
on  interconnect  parasitics.  However,  it  is  very  expensive,  computationally,  to  simulate  the 
fabrication  process  for  VI. SI  circuits,  even  if  analytical  formulas  are  used  in  the  process  simu¬ 
lator  Another  technique  couples  the  library  look-up  method  and  the  numerical  method  such 
as  the  finite  difference  method  in  modeling  interconnects  [3],  In  this  technique,  the  first  step  is 
to  partition  layout  data  into  certain  configurations.  If  the  configuration  is  in  the  library,  then  it 
is  not  difficult  to  generate  interconnect  capacitance  or  resistance.  On  the  other  hand,  if  this 
configuration  has  not  been  encountered  before,  numerical  methods  need  to  be  applied  to  calcu¬ 
late  the  interconnect  parameters  This  new  configuration  can  then  be  added  to  the  library.  This 
technique  gives  quite  accurate  results  because  that  the  look-up  tables  are  built  by  using  accu¬ 
rate  numerical  methods  However,  it  is  still  lime-expensive  if  the  layout  configuration  is 
highly  irregular 

1.3.  Features  of  HPEX 

In  this  thesis,  in  order  to  demonstrate  the  detailed  extraction  of  interconnect  parasitics.  a 
hierarchical,  rectangle-based  circuit  extractor  called  HPFX  (Hierarchical  Parasitic  and  circuit 
L'Xlractor)  has  been  developed.  Features  of  HPFX  can  l>e  described  as  follows. 

1.3.1.  Manhattan  style  layouts 

Manhattan  layouts  described  in  the  C1F  [22]  layout  language  are  used  as  input  to  HPFX 
In  Manhattan  layouts,  all  boundary  segments  are  parallel  to  either  t-axis  or  y-axis.  There  are 
two  reasons  to  adopt  this  layout  style  in  our  study.  First,  the  data  structure  and  extraction 
algorithms  can  be  simplified  and  made  more  efficient  for  Manhattan  layouts  during  circuit 
extraction  as  compared  to  that  for  the  nonrestricled  layouts.  Second,  the  Manhattan  layout 


style  is  gradually  accepted  in  industry  due  to  its  simplicity  in  both  manual  and  automatic  lay¬ 
out  designs.  Therefore,  in  developing  an  experimental  extractor  such  as  HPEX.  the  Manhattan 
layout  style  is  a  good  assumption. 

1.3.2.  Analytical  formulas 

Analytical  formulas  for  resistance  and  capacitance  computations  along  with  a  rectangle 
decomposition  algorithm  are  employed  in  HPEX  to  model  interconnect  parasitics.  The  computa¬ 
tion  lime  can  be  reduced  substantially,  compared  to  using  numerical  methods  to  model  inter¬ 
connect  parasitics.  Although  some  accuracy  might  be  lost  with  analytical  formulas,  it  is  still  a 
good  trade-off.  considering  the  increasing  complexity  of  layout  designs. 

1.3.3.  Layout  resizing 

Layout  resizing  algorithms  are  applied  to  input  layouts  in  the  extraction  preprocessing 
phase  in  order  to  compensate  the  process  bias  such  as  feature  size  widening  due  to  lateral 
diffusion  or  shrinkage  in  polysilicon  lines  due  to  an  incomplete  photoresist  developing.  This 
consideration  for  extracting  real  conductor  feature  sizes  will  greatly  enhance  the  accuracy  of 
the  parasitic  modeling. 

1.3.4.  Separate  process  file 

The  process  file  for  HPEX  is  separated  from  the  execution  code.  A  different  process  file 
an  be  specified  without  recompiling  the  program.  Therefore,  users  can  easily  experiment  with 
many  different  process  files  in  order  to  optimize  the  design. 
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1.3.5.  Different  output  files 

Five  different  output  files,  as  shown  in  Figure  1.2.  can  be  generated  by  HPEX: 

[l]  Input  file  for  schematic  comparison  program  such  as  GEMINI  [  1 1  ]:  If  the  designer  wants 
to  make  sure  that  the  layout  has  the  same  schematic  as  the  original  design,  only  the 
transistor  list  needs  to  be  identified  and  fed  into  the  schematic  comparison  program  such 
as  GEMINI,  which  takes  the  extracted  and  designed  schematics  as  an  input,  then  uses  the 
graph  isomorphism  algorithm  to  test  if  these  two  schematics  are  the  same.  Some  layout 
mistakes  can  be  easily  detected  by  this  program. 


HPEX 


"  GEMINI  \ 
v  Input 

(  SPICE 

V  Input 

\  f  MOSTIM^N 
J  V,  ^Input 

(  Critical 

Schematic 

Comparison 

¥ 

Circuit 

Simulation 

'f 

Timing 

Simulation 

¥ 

Circuit 

Simulation 

Figu 

re  1.2 

Different  output  files  of  HPEX. 
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[2]  Input  file  for  switch-level  simulator  such  as  MOSTIM  [  14]:  If  only  rough  timing  infor¬ 
mation  is  required  or  a  big  circuit  needs  to  be  simulated,  the  extracted  circuit  should  be 
suitable  for  the  timing  simulator  or  switch-level  simulator  such  as  MOSTIM.  Since  most 
of  this  type  of  simulators  cannot  handle  interconnect  resistances,  the  resistance  extraction 
is  not  performed  for  this  output  file.  In  addition,  all  node  capacitances  including  intercon¬ 
nect  capacitances  and  device  nonlinear  capacitances  are  lumped  to  ground  in  the  capaci¬ 
tance  extraction  because  simple  device  models  are  generally  employed  in  the  timing  or 
switch-level  simulators. 

[3]  Input  file  for  circuit  simulator  such  as  SP1CE2  [  1 3]:  If  the  most  accurate  circuit  simula¬ 
tion  is  required,  the  extracted  output  file  should  include  all  detailed  interconnect  parasi- 
tics  such  as  self-  and  coupling  capacitances,  and  all  resistances.  The  nonlinear  capacitances 
associated  with  MOS  transistors  are  not  extracted  explicitly.  Since  SPICE  can  compute  the 
device  internodal  capacitances  for  given  device  dimensions,  only  dimensions  related  to 
device  capacitances  are  extracted.  For  example,  in  the  transistor  extraction  the  source  area 
and  perimeter  are  estimated. 

[4]  Input  file  containing  only  the  critical  path  information  for  SPICE2  :  A  circuit  simulator 
has  its  limitation  in  handling  a  circuit  with  a  large  number  of  transistors.  One  practical 
way  to  simulate  a  large  chip  is  to  simulate  the  critical  path  that  determines  the  chip  tim¬ 
ing.  Hv  having  user-specified  nodes  along  the  critical  path,  all  transistors  and  interconnect 
parasitics  in  this  path  can  be  extracted  by  HPEX.  Besides,  the  capacitive  loading  of  all  side 
branches  is  carefully  estimated  and  added  in  the  extracted  file. 

[■?]  Logic  block  description  :  In  this  output,  the  logic  function  of  each  gate  is  described  in 
terms  of  boolean  equation.  At  present,  this  type  of  extraction  works  only  for  conventional 
\M()S  and  CMOS  circuit  designs. 
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1.3.6.  Node  reduction 


An  accurate  node  reduction  technique  is  developed  and  employed  in  HPEX  to  reduce  the 
number  of  parasitic  elements  for  the  extracted  SPICE  input  file  This  will  reduce  the  computa¬ 
tion  time  in  the  SPICE  simulation.  In  addition,  the  designer  can  easily  pinpoint  the  interconnect 
parasilics  which  are  responsible  for  the  performance  degradation  if  this  degradation  is  indeed 
due  to  the  interconnection  delay. 


1.4.  Overview  of  Thesis 


As  mentioned  in  the  previous  section,  interconnect  modeling  for  VI .SI  circuits  is  a  must  in 
circuit  performance  verification  when  devices  with  small  feature  sizes  are  used  Modeling 
interconnect  parasitics  directly  from  a  circuit  layout,  however,  is  getting  difficult  in  terms  of 
accuracy  and  computation  lime.  In  order  to  study  the  accuracy  of  parasitic  modeling,  two- 
dimensional  empirical  models  for  interconnect  capacitance  and  resistance  are  first  studied.  Based 
on  these  models,  a  two-dimensional  program  called  FEMRC  is  developed  Since  the  equation 
formulation  for  FEMRC  is  based  on  the  finite-element  method,  there  are  no  shape  restrictions 
on  dielectric  interfaces  for  capacitance  calculation  Not  only  capacitance  but  also  resistance  can 
be  calculated  by  this  program.  In  resistance  calculation,  a  quasi-three-dimensional  effect  of  con¬ 
tact  resistance  is  also  taken  into  account.  Details  on  formulation  and  implementation  of  FEMRC' 
are  given  in  Chapter  2. 

In  Chapter  3.  a  layout  partition  technique  for  resistance  and  capacitance  modeling  similar 
to  the  one  proposed  by  Mori  and  Wilmore  in  [20]  is  studied.  By  applying  this  technique,  we 
use  analytical  formulas  fitted  from  numerical  data  rather  than  numerical  methods  to  model 
interconnect  parasitics  of  VLSI  circuits  in  order  to  compromise  between  the  accuracy  and  the 
computation  time.  By  carefully  fitting  data,  analytical  formulas  can  be  very  accurate,  espe¬ 
cially  when  the  interconnect  region  is  fairly  regular.  If  a  small  part  of  the  irregular  layout 
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region  requires  accurate  parasitic  computation,  we  suggest  using  two-  or  even  three-dimensional 
empirical  models  to  calculate  interconnect  parasitics.  Due  to  the  complicated  layout  design,  this 
extraction  methodology  compromising  between  efficiency  and  accuracy  may  be  the  best  way  to 
perform  circuit  extraction  and  to  verify  the  design 

The  data  structure  and  geometrical  algorithms  used  in  HPEX  are  discussed  in  Chapter  4. 
Ip.  order  to  extract  the  actual  feature  sizes  for  interconnect  modeling,  an  efficient  layout  resizing 
algorithm  based  on  a  scanline  approach  and  a  simple  rectangle  data  structure  is  also  presented 
in  Chapter  4  Following  this,  extraction  algorithms  used  in  11P1  \  and  details  on  their  imple¬ 
mentation  are  given  in  Chapter  5.  Finally,  a  new  node  reduction  technique  which  accurate!;, 
reduces  parasitic  A’C  networks  is  studied  'This  technique  is  capable  of  reducing  a  large  olume 
ot  interconnect  data  into  a  manageable  size,  thereby  substantially  reducing  the  effort  needed  in 
verification  The  theoretical  background  and  implementation  details  lor  the  node  reduction 
technique  can  Ire  found  in  Chapter  6.  F  inally,  conclusions  and  recommendations  on  luture 
research  topics  are  given  in  Chapter  7 
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CHAPTER  2. 


I  W  ODIMFNSIONAL  MODELS  FOR  INTERCONNECTS 


2.1.  Introduction 

Although  it  is  well  known  that  reduction  ol  circuit  dimensions  will  decrease  the  intercon¬ 
nect  capacitance  proportionately,  there  are  two  design  considerations  that  force  the  capacitance 
to  stay  high  The  (irst  consideration  is  that  designers  try  to  increase  the  chip  reliability  with 
respect  to  the  elec  immigration  effect  in  aluminum  conductors  bv  scaling  the  conductor  thick¬ 
ness  do  a,  n  slow  I  y  in  comparison  to  the  reduction  of  feature  size  I'he  second  consideration  is 
(hat  see  attempt  to  minimize  the  signal  delay  by  reducing  the  sheet  resistance  Since  the  sheet 
resistance  is  incerselv  proportional  to  the  conductor  thickness,  the  latter  thickness  is  also  scaled 
down  slo\c  !y  compared  to  the  chip  horizontal  scaling  As  a  result,  the  side-wall  fringing  capa¬ 
citance  ol  conductors  becomes  important  in  determining  the  overall  capacitance  It  has  been 
predicted  that  the  interconnection  delay  will  become  dominant  in  deciding  the  chip  operating 
speed,  especially  when  the  scale-down  of  device  feature  size  continues  [15],  In  order  to  accu¬ 
rately  predict  the  performance  of  VI  .SI  circuits,  the  capacitance  of  interconnect  lines  should  be 
carefully  modeled  Hecau.se  of  the  important  role  played  by  the  fringing  held  in  determining  the 
total  capacitance,  two  dimensional  or  even  three  dimensional  effects  should  be  taken  into 
account . 

As  mentioned  in  ('hapter  1.  not  only  the  capacitance  but  also  the  resistance  determines  the 
performance  of  \  LSI  circuits.  As  for  the  resistance  calculation,  several  methods  have  been 
reported  [2<>.  23- 2f>],  One  of  most  commonly  used  methods  >.ompules  the  resistances  of  rec¬ 
tangular  shaped  conductors  by  simple  counting  the  number  of  squares  in  the  conductor  and 
multiplying  it  by  the  sheet  resistance  When  using  this  method,  the  direction  of  current  flow 


must  he  estimated  before  calculating  the  length-to-width  ratio  of  conductors.  One  disadvan¬ 
tage  ot  this  method  is  its  limitation  to  the  acceptable  resistance  shape.  I  or  example,  the  resis¬ 
tance  shape  with  boundary  segments  not  parallel  with  x-axis  or  y-axis  cannot  be  correctly 
handled  by  this  method,  in  addition,  two-dimensional  effects,  such  as  current  crowding,  and 
contact  resistance  are  not  taken  into  account  by  this  method.  In  order  to  improve  resistance 
accuracy,  numerical  analyses  of  physical  equations  are  generally  applied.  These  numerical 
approaches  include  the  Schwarz-Christoffel  transformation  [25],  the  finite  difference  method, 
and  the  finite-element  method  (FEM)  [26],  In  this  chapter,  the  FFM  is  adopted  to  calculate  the 
interconnect  resistance.  The  proposed  resistor  model  can  handle  any  arbitrarily  shaped,  multi- 
region  and  multi-electrode  conductor  regions.  Furthermore,  by  using  a  different  physical  equa¬ 
tion  lor  contact  regions,  contact  resistances  can  be  incorporated  into  the  resistance  calculation. 

In  the  following  sections,  a  two-dimensional  mode!  for  calculating  capacitances  between 
multiple  conductors  is  first  described,  followed  by  a  discussion  of  two-dimensional  resistance 
models  Then  some  features  of  a  finite-element  program  called  FEMRC  are  given.  Finally,  some 
examples  ot  FFMRC  for  capacitance  and  resistance  calculation  are  illustrated. 

2.2.  Capacitance  of  Multiconductors 

2.2.1.  Integral  method 

The  integral  method  expresses  the  solution  to  the  Laplace  equation,  which  determines  the 
potentials  in  a  region  D,  by  the  following  form: 

<t>(r)  =  J (.i  (r  ,r'  )(T{r‘)dr  '  .  (2.1) 

D 

cchere  4><  r  )  is  the  potential,  cr(r')  denotes  the  charge  density  at  point  r'.  and  G(r.r')  is  an 


appropriate  Creen  fun  lion  that  describes  the  potential  at  r  induced  by  a  unit  charge  at  r\ 
\ithough  the  indicated  integration  covers  the  entire  region  D.  it  needs  in  fact  to  be  performed 
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only  where  the  charge  density  cr(r')  is  not  zero.  For  an  N-conductor  system,  if  the  potential  at 
the  surface  of  each  conductor  is  specified,  the  only  unknowns  left  in  Eq.  (2.1)  are  the  charge 
densities.  By  using  the  method  of  moments  and  a  set  of  basis  and  testing  functions  [27].  charge 
densities  in  Eq.  (2.1)  can  be  solved  by  a  system  of  linear  equations. 

Once  charge  densities  are  obtained  on  the  surface  of  all  conductors  for  different  combina¬ 
tions  of  conductor  potentials,  the  capacitance  coefficients  can  be  easily  computed.  Consider  a 
three-conductor  problem  shown  in  Figure  2.1.  The  capacitance  coefficients  can  be  defined  as 

<2 1  =  C, ,«>,  +  C12(4>,-<t.2)  +  CI3(*  -4>3) 
q2  =  c2i(<t>2-<t>l)  +  C22<X>2  +  C23($2—<t>3) 

Q3  —  C‘31(<I>3  +  Cj2(<1)3—<1>2^  C 33*^3  (2.2) 

where  Q ,  is  the  total  charge  on  conductor  i.  is  the  potential  of  conductor  j .  and  C(y  denotes 

the  capacitance  coefficients.  By  applying  even  and  odd  mode  potential  patterns.  C,y  in  Eq  (2.2) 

can  be  easily  determined 


GROUND  PLANE 


Figure  2.1  A  three-conductor  system. 


14 


One  of  the  critical  tasks  in  using  integral  method  for  capacitance  calculation  is  to  derive 
an  appropriate  Green's  function  for  different  dielectric  regions.  Two  different  methods  for 
deriving  Green  s  function  have  been  proposed:  (1)  the  method  of  images  [28],  and  (2)  spectral 
analysis  [29].  However,  these  two  methods  are  only  practical  for  dielectric  regions  with  planar 
interfaces.  The  Green  function  for  non-planar  dielectrics  is  too  complicated  to  obtain  expli¬ 
citly.  This  is  one  of  the  disadvantages  in  applying  the  integral  method  to  compute  the  capaci¬ 
tance.  But  from  the  computer  time  and  storage  point  of  view,  the  integral  method  is  still  a 
good  and  acceptable  means  to  compute  the  interconnect  capacitance.  A  typical  program  which 
uses  this  method  to  calculate  capacitance  is  CAP2D  [27], 

2.2.2.  Finite-element  method 

Instead  of  solving  the  charge  densities,  the  differential  method  computes  the  potentials 
directly  by  solving  the  following  Laplace  equation  over  a  region  12  subject  to  some  boundary 
conditions. 

VeV<I>(.*  .y  )  =  0  (x  ,y  )  €  O  (2.3) 

Boundary  condition  (BC)  has  to  be  specified  at  every  point  of  the  boundary  in  order  to  obtain  a 

unique  solution  of  the  above  equation.  Boundary  conditions  can  be  further  classified  into  two 

principal  types  for  the  Laplace  equation: 

(  1  )  Dinchlet  (nr  Essential  )  BC  An  equation  relating  the  values  of  <t>  at  points  of  the  boun¬ 
dary 

(  2)  Neumann  (nr  Natural  )  BC  :  An  equation  relating  the  values  of  first  derivative  cl  <I>  (VtO 
at  points  of  the  boundary 

Two  methods  ^an  be  employed  to  numerically  solve  the  Laplace  equation:  (1  )  finite-difference 
method  if  DM*,  and  (2)  finite-element  method  (FLM).  The  approach  we  describe  here  is  tfie 


dielectrics. 

Two  methods  are  generally  used  to  formulate  finite-element  equations:  (1)  minimization 
of  the  functional,  and  (2)  weighted  residual  process  Since  the  functional  of  the  Laplace  equa¬ 
tion  is  explicit,  it  is  convenient  to  apply  the  minimization  of  the  functional  to  derive  finite- 
element  equations.  The  problem  to  solve  Fq.  (2.3)  is  then  equivalent  to  finding  <J>.  which 
satisfies  boundary  conditions  and  minimizes 

2  *> 

C  r  1  d<l>  d®  ,  x 

\  =  f  I  — €  -  +  -  dxdx  .  (2.4) 

Q  2  e*  ay 

where  (1  is  the  region  of  interest.  If  the  region  of  interest  is  discretized  into  A/  grid  points,  the 
approximate  potential  in  this  region  can  be  expressed  as 

v 

<P(  x  .V  )  =  .V  )</>,.  (2.5) 

where  M,  is  the  shape  function  (or  the  interpolation  function),  and  c6,  is  the  potential  at  node  i 
The  shape  function  \\  has  the  following  property 

0  i  *  j 

A'  (.x  ,y  )  =  (2.6) 

' ;  i  1  =  ; 

For  a  linear  triangular  element  shown  in  Figure  2.2.  the  shape  function  can  Ire  derived  as 


N ,{x  ,v  )  = 


(al  +b,x  +cly  ) 


where  the  area  A  is  given  bv 


1  *i  >'i 


A  =  —  '  1  .T  -*  V, 

2  1  ‘  ‘ 
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and  the  constants  a  .  b  .  and  c  are  riven  in  terms  of  nodal  coordinates  bv 


Figure  2.2  A  triangular  element. 


ai  =  *2.V3-*jV2 
b\  =  3 

c j  =  .t 3— x 2  .  (2.9) 

with  the  others  being  obtained  by  cyclic  permutation.  Let  the  unknown  potential  <J>  be  defined 

element  by  element;  the  element  contribution  could  then  be  evaluated  using  Eq  (2.4).  For  any 

node  t .  t  =  1.2.3.  by  differentiating  Eq.  (2.4).  we  can  write 

d*  _  r  r  d®  d  cH*  6^  d  d® 

-  —  I  € - - + - -  dxd  V  (2.10) 

&*>,  a*  a*,  dx  a>-  d4>,  ay 

Substituting  Eq  (2.5)  into  Eq.  (2.10).  We  have  the  following  matrix  equation  for  a  triangular 
element 

=  f/i  r  i*r .  (2.1D 

a<*r 

w  here  the  element  (or  stiff )  matrix  [,'1  T  for  a  linear  triangular  element  is  given  bv 


u>  r  = 


e 


(2  12) 


it 


6,6,+CjC;  blb2+clc2 
b  -,b  |  "f  c  ->c  |  b  ~tb  -» "4"C  ->c  -> 
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16  3/>  j +c  3c  (  b2b-,+c  3c  2 

may  be  noted  that  the  element  matrix  is  symmetric. 


6^3+c.c3 
b  -ib  ^  ^  c  ~)C 

bib3+CiCl 


After  assembling  a  whole  set  of  minimization  equations  and  imposing  suitable  boundary 
conditions  such  as  conductor  potentials,  we  can  solve  the  lollowong  set  of  linear  equations  to 
find  the  potential  at  each  grid  point: 

[//]!<*}  =  !*>}  (2.13) 

In  the  above  equation,  the  right  hand  vector  {61  is  nonzero  because  of  Dirichlet  boundary  con¬ 
ditions  being  1  ('reed.  In  this  ^ase.  me  Dirichlet  boundary  condition  is  the  conductor  voltage  In 
order  to  get  a  better  accurac.  in  11M  a  typical  global  matrix  [// ]  which  contains  a  large 
number  ill  equations  is  used  However  in  general  most  of  the  terms  in  the  matrix  [//  ]  are  zero, 
there!  re.  sparse  matrix  techniques  such  as  minimum  liil-in  reordering  and  11  factorization 
are  employ  ed  to  sole  e  f-q  <2  13  <  in  order  to  save  memory  space  and  CPI  time. 

Once  potentials  at  grid  points  are  obtained  by  I  q  <2  13).  the  electric  field  within  each 
linear  triangular  element  can  be  computed  as  follows: 

-i  1  (  _  _! 

I.  =—  V<t>  = -  f b  ,d>,  +b  ,<t>^+b  (d>, j.v  <c  ,(£, +c ,<£,+<:  ,6v)v  .  (2  14) 

2A  ‘  ‘  '  ’ 

w  nere  i  and  y  are  unit  vectors  in  the  x  and  v  -directions,  respectiv  ely  By  integrating  the 
electric  held  around  the  conductors  and  applying  the  (iauss  law.  the  charge  on  each  conductor  is 
giv  en  by 


Q  -  f  e/7  J? 

( 

( ’■  >nsidering  an  \-conductor  sy  stem,  the  capacitance  matrix  can  be  solved  by 


(2.15) 
the  following 


matrix  equation 


taj 


[C]  =  [<2][VP.  (2.16) 

w  here  [V]  is  the  voltage  excitation  matrix  which  contains  N  independent  excitations,  and  [Q]  is 

the  corresponding  charge  matrix. 

Since  it  is  impossible  to  discretize  the  region  with  open  boundary  in  the  FLM.  a  closed 
region  is  used  to  approximate  the  infinite  region.  As  a  consequence,  some  accuracy  may  be  lost 
due  to  this  approximation.  However,  by  carefully  choosing  the  region  of  importance,  we  still 
can  maintain  the  accuracy  of  results.  For  example,  in  capacitance  calculation  the  closed  boun¬ 
dary  is  chosen  such  that  the  electric  field  on  the  boundary  should  be  negligible. 

As  described  before,  unlike  the  integral  method,  the  finite-element  method  is  not  res¬ 
tricted  to  the  geometric  configuration  of  the  insulators  and  conductors.  This  has  an  important 
implication  in  the  present  multilayered  interconnect  technology  because  the  true  structure  of 
conductors  and  surrounding  insulating  layers  can  be  simulated.  Therefore,  in  a  VLSI  chip  with 
small  feature  sizes,  the  finite-element  method  seems  to  be  a  better  approach  in  determining  the 
interconnect  capacitance. 


2.3.  Resistance  Calculation 


2.3.1.  Conductor  region  "without  contact  windows 


Consider  an  interconnect  region  H  with  no  contact  windows  as  shown  in  Figure  2.3.  Let 
the  electrodes  be  around  the  boundary  of  this  region.  The  governing  equation  can  be  described 
by  the  following  Laplace  equation: 


V  —  V4>(x  .y  )  =  0  U  ._>')€« 

Rs 


(2.17) 


s  \ 


I 


Boundary  Conditions: 

<P(.x  .v  )  ~  Constant  (.v  .v  )  €  Electrode  ( 1  )  and  Electrode  (2) 

V'DC*  .y  )  =  0  (x  .y  )  £  Electrode  ( 1 )  and  Electrode  (2) 
where  Rs  is  the  sheet  resistance.  Since  Lq.  (2.17)  has  the  same  form  as  Eq  (2.3).  two- 

dimensional  resistance  calculation  in  this  case  follows  the  same  procedures  described  in  the 

1 

preceding  section.  The  onlv  difference  is  that  the  permittivity  €  is  replaced  bv  — .  The  element 

R, 

matrix  for  a  linear  triangular  element  in  resistance  calculation  is  given  by 

b1bl+clcl  bib2+clc2  6,6,+c,c, 

,  ,,  1 

l  b  ]  b  2  b  2^~c  2c  2  b  ib  ^^c  2c  (  2  1 N ) 

b3b1+c3ci  b3b2+c3c-,  bfi3+c3c3 

After  the  global  matrix  is  assembled  from  all  element  matrices  by  incorporating  the  boun¬ 
dary  conditions,  the  potentials  at  grid  points  can  be  solved.  From  the  potential  information, 
the  current  flow  ing  out  or  into  each  electrode  can  be  easily  calculated  by 


/.  =  fl  dT  = - f  ?<t>  ds 

'  R.  , 


(2  i<> ' 


-»  ».)  t.«  lt»  l«»  '.t 


!*»-  >*•  »>«  >i.  iu  j>.  *1.  fci.  iVH,  |>.  >1.  iu  aCUfa 


where  T  is  a  line  segment  along  the  electrode  E .  Because  a  linear  interpolation  potential  func¬ 
tion  has  been  assumed  inside  a  linear  triangular  element,  the  current  density  in  a  triangular 
region  is  a  constant. 

Consider  an  interconnect  region  with  N  electrodes.  The  resistances  uetween  N  electrodes 
can  be  calculated  by  repeatedly  applying  tne  following  procedures: 


( 1 )  For  electrode  i .  set  <!>,  =  1  and  <J>;  =  0  ~V  j 


<  2  )  ( )btain  potential  solution  and  calculate  the  electrode  current  . 


( 3  )  Resistance  R  is  then  computed  by  the  following  equation: 


(2.20) 


N(A/  +  l) 


*>,  =  ~  =  ~ 

Repeat  the  above  procedures  for  i  from  1  to  \.  a  complete  resistance  network  with 
resistors  can  be  obtained. 


2.3.2.  Conductor  region  with  contact  windows 


Due  to  a  large  amount  of  contacts  required  in  a  VLSI  chip  design,  the  metal-to-diffusion 
and  metal-to-polvsilicon  contact  resistances  inevitably  play  an  important  role  in  determining 
the  total  interconnect  resistance.  In  this  section,  we  will  take  the  contact  windows  into  the 
finite-element  equation  formulation  in  order  to  calculate  the  interconnect  resistance  including 
the  contact  resistance.  Consider  a  diffusion  region  with  boundary  and  contact  window  elec¬ 
trodes  as  shown  in  Figure  2.4.  Assume  that  the  contact  regions  are  covered  by  metal  and  the 
metal  voltage  is  constant.  For  this  case,  the  two-dimensional  field  problem  can  be  described  [30] 
as  follows: 

(  1  i  Outside  the  contact  window,  the  potential  must  satisfy  the  Laplace  equation,  namely, 
liquation  (2.17). 


»•,*•*-* 


(2)  Inside  the  contact  window,  the  potential  is  governed  by 


1  1 

V  — yij)  =  —  . 

R. 


(2.21) 


which  is  the  Helmholtz  equation,  where  pc  is  the  specific  contact  resistivity  and  is  the 


voltage  on  the  contact  metal.  The  specific  contact  resistivity  pc  is  defined  as 


!<y  (*•„„> 


(2.22) 


/here  vmt  is  the  potential  difference  between  the  metal  Fermi  potential  and  the  semicon¬ 


ductor  potential.  For  each  process.  pc  is  unique  and  can  be  determined  by  a  certain  test 


structure. 


The  case  of  calculating  resistances  for  the  regions  outside  the  contact  window  has  already 


been  discussed  in  the  previous  section.  In  this  section  we  will  concentrate  on  formulating  the 


finite-element  equations  in  the  region  within  the  contact  window.  It  has  been  shown  that  the 
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functional  of  the  Helmholtz  equation  is 
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dxd\ 


(2.23) 


If  again  the  linear  triangular  element  is  chosen,  by  differentiating  the  elemental  functional  with 
respect  to  nodal  potentials,  we  ha';e  the  following  matrix  equation: 

dx 


dW 


-  =  [/>r{0r  +  ifr. 


(2.24) 


where 


[hr  = 


6  j  "fc  jC  j  j&2 9  6  1^3^^  3 

2  1  1 

i 

A 

6-*&j4*C2Cj  b  2&2+c->c->  b->b^+C2C^ 

1  2  1 

12  pt 

b  ^b  j  “4“ c  j  b  -yb  <)  1  c  ->  b  -^b  ^  *1" c  j 

1  1  2 

(2.25) 


and 


I F  l*  =  — 


3  P, 


(2.26) 


Consider  a  conductor  with  both  noncontact  and  contact  regions.  The  global  matrix  equation 
associated  with  this  conductor  can  be  assembled  on  an  element-bv-element  basis.  In  a  matrix 
assembling  process,  two  different  types  of  element  matrices  should  be  applied  depending  on  the 
location  of  the  element.  Equation  (2.18)  is  used  for  a  noncontact  element,  while  both  Eqs. 
(2.25)  and  (2.26)  are  applied  for  a  contact  element.  Using  the  same  procedure  as  described  in 
the  previous  section,  a  complete  resistance  network  corresponding  to  the  region  of  interest  can 
be  obtained.  It  is  noted  that  the  current  flowing  into  a  contact  window  can  be  calculated  either 
by  integrating  the  current  density  around  the  contact  window  or  by  summing  the  current  con¬ 
tributed  by  all  the  elements  inside  the  contact  region.  Using  the  latter  method,  we  can  derive 
the  current  flowing  into  a  contact  in  a  closed  form: 
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(2.27) 


where  e,  is  the  region  of  the  triangular  element  i . 


2.4.  Finite-Element  Program  :  FEMRC 


A  finite-element  program  FEMRC.  which  implements  the  previously  described  capacitance 
and  resistance  models,  has  been  developed.  The  main  flowchart  showing  the  capacitance  and 
resistance  calculation  phases  is  illustrated  in  Figure  2.5  The  grid  generator  1GG12  from  the  dev¬ 
ice  simulator  P1SCES-IIB  [3l]  is  employed  in  FEMRC  to  automate  the  preliminary  triangulation 
phase  by  the  user-specified  boundary  points  and  regions. 


The  main  features  of  FEMRC  are 


1 )  Due  to  the  use  of  1GG12  the  users  only  have  to  provide  a  small  amount  of  input  data  com¬ 
pared  to  the  massive  amount  of  data  for  all  triangular  elements  required  in  computation. 


(2)  Having  different  options.  FEMRC  can  simulate  both  two-dimensional  capacitances  and 
resistances. 


(3)  In  capacitance  calculation,  nonplanar  dielectrics  are  easily  handled.  There  is  no  limitation 
on  the  number  of  conductors  or  dielectrics. 


(4)  In  the  resistance  calculation,  any  arbitrarily  shaped,  mulliregion  and  multielectrode  resis¬ 
tor  can  be  simulated. 
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2-5.  Examples  and  Discussion 


Table  2  1  Test  structures  and  different  passivations 
for  FEMRC  calculations. 


test  structures 


case 


pitch 


linewidlh  \V 


(/x m  ) 


( fJ.m  ) 


spacing  S 


{fj-m  ) 


(1) 


1.7 


1.3 


(2) 


1.8 


2.2 


(3) 


2.3 


2.7 


(4) 


2.8 


2.2 


passivations 


(A) 


\o  passivation  =  1 ). 


(B) 


1  tim  thick  SiQ2  =  3-9). 


(C) 


1  fwi  thick  nitride  IJr  =  7.0). 


(D) 


1  /xm  thick  SiO-,  +  1  fim  thick  nitride  on  the  top. 


('  P  Yuan  and  T  N  Trick.  ICTAD-S4.  rP.  263-265 


Table  2.2  Percentage  errors  of  self-capacitances  calculated  by 
FEMRC  and  CAP2D  for  the  structures  listed  in  Table  2.1*. 


*  Assume  planar  dielectric  interfaces  for 
both  FEMRC  and  CAP2D  calculations. 

Percentage  errors  calculated  by  CAP2D  are  taken  from 
C.  P.  Yuan  and  T  Trick.  ICCAD-84,  pp.  263-265. 


Table  2.3  Percentage  errors  of  coupling  capacitances  calculated  by 
FEMRC  and  CAP2D  for  the  structures  listed  in  Table  2.1*. 


passivation 

case  ( 1 ) 

case  (2) 

case  (3) 

case  ( 4 ) 

cm 

0.0205 

0.0130 

0.0107 

0.0145 

C, (FEMRC) 

0.0172 

0.0097 

0.0075 

0.0104 

%  (FEMRC) 

-16 

-25 

-30 

-28 

WMlXUSTAm 

-S.8 

-19 

-21 

-23 

Cm 

0.030 

0.031 

C.  ( FEMRC ) 

0.0485 

0.0291 

0.0294 

%  (FEMRC) 

-1.0 

-3.0 

-4.2 

-5.2 

3.9 

2.0 

2.9 

1.9 

fl 

_ c* _ 

0.039 

0.030 

0.040 

0.0814 

0.050 

■H 

%  (FEMRC) 

17 

28 

33 

25 

%  (CAP2D) 

21 

33 

40 

33 

(D) 

.  cm . 

0.050 

0.0345 

0.0268 

0.0362 

C,  (FEMRC ) 

0.0529 

0.0341 

0.0287 

0.0357 

5.8 

1.2 

7.1 

%  (CAP 2D  ) 

9.6 

2.6 

13 

SBBS— 

^Assume  planar  dielectric  interfaces  for 
both  FEMRC  and  CAP2D  calculations. 

Percentage  errors  calculated  by  CAP2D  are  taken  from 
C.  P.  Yuan  and  T.  N.  Trick.  1CCAD-84.  pp.  263-265. 


from  [32].  Percentage  errors  for  both  FFMRC  and  CAP2I)  by  using  the  measured  data  as  the 
reference  are  computed  as  follows: 

C  -C 

s  'tl 

Percentage  error  ( 9c )  =  1(X)X - 

c,„ 

By  comparing  the  results,  it  is  observed  that  percentage  errors  for  self-  and  coupling  capaci¬ 
tance  calculations  are  comparable  for  FFMRC  and  CAP2I)  if  the  measured  data  are  used  as  the 
reference.  The  percentage  errors  for  self-capacitance  are  all  within  109  However,  the  percen¬ 
tage  errors  for  coupling  capacitance  are  consistently  greater,  especially  in  passivations  (A)  and 
1  C).  T  h is  may  be  due  to  the  ;  act  that  coupling  capacitance  is  highly  sensitic  e  to  the  variation  of 


conductor  spacing  and  dielectric  between  conductors.  Besides,  it  should  be  noted  that  the  meas 


ured  data  on  a  physical  model  of  a  structure  cannot  be  regarded  as  exact 

In  order  to  take  a  step  coverage  of  passivation  into  account,  a  piecew  ise  line  is  used  in  the 
second  set  of  the  simulations  to  approximate  the  nonplanar  dielectric  interface  in  the  passiva¬ 
tion  layer.  The  simulation  results  along  with  the  measured  data  are  listed  in  Tables  2  4  and 
2.5.  It  should  be  noted  that  passivation  (  A)  is  not  included  in  the  tables  because  no  dielectric 
covers  the  conductors  in  this  case.  By  comparison,  we  can  see  that  the  percentage  error  for 
self-capacitance  stays  almost  the  same.  However,  the  results  for  the  coupling  capacitance  have 
been  improved  on  the  average,  for  example,  in  the  case  of  passivations  (C)  and  (D).  From  this 
observation,  it  is  indicated  that  if  we  want  to  improve  the  simulation  accuracy  by  taking  non¬ 
planar  dielectric  into  account.  FFMRC  is  an  indispensable  tool. 


Table  2.4  Percentage  errors  of  sell -capacitances  calculated  by 
FFMRC  for  the  structures  listed  in  Table  2.1r. 


F 

assivation 

case  (2) 

case  ( 3 ) 

case (4)1 

c.„ 

0.071 

0.088 

0.091 

(B) 

C\  ( FFMRC  ) 

0.056 

0.0679 

0.082  7 

0.0874  i 

%  (  FFMRC  ) 

1.8 

■ 

“Z7i 

-4.0  i 

0.076 

0.094 

0096  ! 

((') 

Cs  i  FFMRC ) 

0.0567 

00702 

0.0860 

00897  1 

T  ( FFMRC  ) 

-2.2 

-7.6 

-8.5 

-6.6  : 

0.057 

0.074 

0.091 

0.094  [ 

( I) ) 

C,  ( FFMRC  ) 

0.0577 

0.0694 

00842 

0 0888  : 

T  ( FFMRC  ) 

1.2 

-7.5 

1 

t 


Assume  nonplanar  dielectric  interfaces  for  FFMRC  calculations. 


Table  2.5  Percentage  errors  of  coupling  capacitances  calculated  by 
FFMRC  for  the  structures  listed  in  Table  2.1  . 


passivation 


case(l)  !  case (2)  I  case (3)  i  case (4) 


C  ( FFMRC  )  0.0471 

%  ( FFMRC  )  -3. 


C  (FHMRC)  0.0778 


( FFMRC  ) 


(1))  C A FFMRC)] 00521 


(FFMRC) 


0.0251 


0.039 


0.0345 


0.0333 


00195 


9 


0.030 


0.0268 


0.0277 


0.040 


0.0414 


3.5 


0.0362 


0.0354 


Assume  nonplanar  dielectric  interfaces  for  FHMRC  calculations. 


Example  2: 


In  this  example,  two  conductors  with  slant  sidewalls  situated  on  the  top  of  oxide,  as- 
shown  in  Figure  2.7.  are  simulated.  From  a  cross-sectional  view  of  this  structure,  it  can  be  seen 
that  the  conductors  are  first  covered  by  a  layer  of  nitride  with  0.35/am  thickness,  then  another 
thick  layer  ol  polyimide.  Simulation  results  for  conductors  w'ilh  spacing  ranging  from  1.0 fim 
to  5.5/j.m  are  listed  in  Table  2.6.  The  capacitance  Ct  is  the  fringing  capacitance  between  the 
conductor  edge  and  ground,  and  C(  is  the  coupling  capacitance  between  two  conductors.  In 
I  able  2.6.  calculated  capacitances  for  both  nonplanar  and  planar  dielectric  cases  are  both  listed 
lor  comparison.  It  is  observed  that  the  nonplanar  dielectric  interface  ;n  general  has  greater 
impact  (>n  the  coupling  capacitance  than  on  the  fringing  capacitance  This  is  because  the  non¬ 
planar  dielectric  interlace  significantly  changes  the  equivalent  dielectric  constant  between  two 
conductors  if  there  is  a  large  difference  between  the  dielectric  constants  of  the  two  lavers  of 


passivations.  W  ithout  considering  the  nonplanar  dielectric  effect,  we  obtain  an  average  15r 
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Figure  2  7  Cross-sectional  view  of  two  conductors 
with  a  nonplanar  dielectric  interface 


error  in  ihe  fringing  capacitance  calculation.  However,  the  percentage  error  is  increased  to  259c 
for  the  case  of  the  coupling  capacitance  calculation. 

Example  3: 


Figure  2.8  shows  a  diffusion  region  with  one  contact  electrode  and  one  boundary  electrode. 
Assume  that  the  sheet  resistance  is  12.4fi  and  the  specific  contact  resistivity  is  24.3  Q/zm\  The 
calculated  resistance  between  the  two  electrodes  is  1 9.3 iz  including  the  contact  resistance.  For 
comparison,  we  replace  the  contact  electrode  by  a  boundary  electrode  around  the  contact  region 
and  perform  the  simulation  again.  The  calculated  total  resistance  is  10.8fl  for  this  case.  An 
8.5  Q  difference  is  due  to  the  effect  of  contact  resistance.  If  a  circuit  layout  has  a  large  number 
ol  such  contacts,  neglecting  them  in  circuit  extraction  may  cause  an  incorrect  prediction  of  cir¬ 
cuit  performance  in  the  next  step  of  verification. 


Example  4: 


In  this  example,  we  use  a  square  coaxial  transmission  line  [25]  shown  in  Figure  2.9  to 
evaluate  the  calculated  accuracy.  The  exact  resistance  of  this  structure  is  known  to  be 
1. 287095144ft  if  the  sheet  resistance  is  1  ft.  Table  2.7  shows  the  percentage  errors  and  the  cal¬ 
culation  times  for  different  numbers  of  grid  points  chosen  in  FEMRC.  The  calculation  time 
includes  grid  generation,  triangulation,  matrix  solving  and  other  setup  times.  It  is  observed  that 
increasing  the  number  of  triangular  elements  improves  the  calculated  accuracy,  just  as  theoreti¬ 
cally  predicted  But  the  calculation  time  is  significantly  increased  if  a  large  number  of  grid 
points  are  used  in  the  calculation.  For  the  practical  application,  however,  a  trade-off  between 
the  calculated  accuracy  and  the  computation  time  should  be  made  in  order  to  save  the  computer 


resources. 


Table  2  7  Runtimes  and  percentage  errors  of  resistance  calculations 
for  the  conductor  shown  in  Figure  2.9 
with  different  triangulalions. 


resistor  can  be  simulated.  In  addition,  contact  resistance  is  modeled  by  the  Helmholt2  equation, 
in  which  quasi-three-dimensional  current  flow  is  taken  into  consideration.  Experience  with 
FEMRC  shows  that  it  is  a  practical  useful  tool  in  VLSI  performance  verification. 

At  the  present  time.  FEMRC  is  not  incorporated  into  the  layout  extraction  program  HPEX 
However,  it  generates  the  reference  values  for  analytical  capacitance  and  resistance  models 
employed  in  HPEX.  Details  for  the  analytical  models  for  capacitance  and  resistance  will  be  dis¬ 


cussed  in  the  next  chapter. 


CHAPTER  3. 


RESISTOR  AND  CAPACITOR  MODELS  IN  HPEX 

3.1.  Introduction 

Several  methods  have  been  developed  to  calculate  distributed  resistances  and  capacitances 
of  interconnection  lines  in  a  circuit  extraction  program.  In  general,  these  methods  can  be 
classified  into  the  following  three  categories:  (1)  numerical  methods  [33-34.24],  (2)  table- 
look-up  methods  [3],  and  (3)  analytical  formulas  [19-20. 35]. 

Numerical  methods,  as  described  in  the  previous  chapter,  are  used  to  compute  capacitances 
and  resistances  by  discretizing  the  field  equation  in  the  space  or  the  surface  of  the  conductor  of 
interest.  Since  the  accuracy  of  the  computation  strongly  depends  on  the  number  of  grid  points, 
a  large  number  of  grid  points  generally  needs  to  be  generated  in  order  to  obtain  a  reasonably 
satisfactory  result-  This  in  turn  implies  that  a  large  number  of  linear  equations  have  to  be 
solved.  Therefore,  numerical  methods  often  require  a  large  amount  of  computer  memory  and 
time.  Hence,  although  they  are  able  to  model  the  real  shapes  of  conductors  easily  and  very 
accurately,  numerical  methods  are  impractical  for  the  majority  of  parasitic  extractions  of  VLSI 
circuit  layouts  from  the  computational  standpoint.  One  practical  application  of  the  numerical 
methods  is  that  they  can  be  used  to  model  a  small  number  of  irregular  layouts,  while  allowing 
more  regular  shaped  conductors  to  be  handled  by  other  simpler  models. 

The  table-look-up  method  and  analytical  formula  approach  are  simpler  in  comparison  to 
the  numerical  model.  Table-look-up  methods  are  fairly  efficient,  but  constructing  tables  for  all 
possible  layout  configurations  would  be  time-consuming.  Therefore,  analytical  formulas  are 
employed  in  our  resistance  and  capacitance  computations  for  simplicity  and  efficiency.  Some 
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accuracy  may  be  lost  if  analytical  formulas  are  adopted.  However  by  carefully  fitting  two-  or 
even  three-dimensional  numerical  data,  analytical  formulas  still  produce  results  with  good 
accuracy  Furthermore,  since  only  Manhattan  layouts  are  allowed  as  the  input  to  our  circuit 
extractor.  derivation  of  analytical  formulas  with  reasonable  accuracy  is  much  easier  Our  goal 
is.  therefore,  to  find  accurate  analytical  formulas  for  resistances  and  capacitances  in  actual  cir¬ 
cuit  layouts  and  use  them  in  the  circuit  extractor  In  this  manner,  interconnect  parasitics  can  be 
extracted  in  a  reasonable  amount  of  CPU  time,  as  well  as  being  modeled  with  reasonable  accu- 
rac  v 


3.2.  Resistor  Models  Used  in  the  Extractor 


3.2.1.  Long,  straight  interconnect  conductors 


Since  our  extractor  handles  only  Manhattan  circuit  layouts,  simple  formulas  have  been 
employed  to  compute  the  resistances  of  nets  For  long,  straight  conductor  lines,  the  well- 
known  one-dimensional  resistance  formula  given  below  is  used. 


R 


=  R. 


length 


(3.1 ) 


width 

where  R is  the  sheet  resistance.  This  means  that  the  value  of  resistance  in  a  long  conductor 
line  is  the  sheet  resistance  times  the  total  number  of  squares  in  the  line.  When  rectangular  con¬ 
ductors  of  the  same  material  with  various  widths  are  cascaded,  as  showm  in  Figure  3.1.  the 
total  resistance  is  approximated  by  the  following  equation: 
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This  is  only  an  approximation,  since  the  current  crowding  effect  around  some  corners  of  rectan¬ 
gles  has  been  neglected.  The  accuracy  of  this  approximation  is  hard  to  predict,  since  it  varies 
with  the  actual  layout  implementation.  For  example,  the  actual  resistance  of  the  layout  shown 


3.2.2.  Right-angle  bend  in  the  conductors 


An  important  shape  that  we  have  taken  into  account  is  the  right-angle  bend,  as  shown  in 
Figure  3  3  in  the  conductors  If  the  corner  rectangle  is  a  square,  the  resistance  of  the  corner  rec¬ 
tangle  is  less  than  that  computed  by  using  the  center  line  as  the  length  due  to  current  crowding 
The  two-dimensional  analytic  formula  we  use  to  approximate  the  resistance  of  the  corner  rec¬ 
tangle  is  [3b] 
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w  here  a  is  the  ratio  of  wide-to-narrow  widths  of  the  corner  rectangle.  In  Figure  3  3.  a  =  — 

VV 

w  ith  the  assumption  that  w,  ^  w,.  It  should  be  noted  that  il  the  corner  rectangle  is  a  square, 
i  e  .  a  =1  the  resistance  R  is  0  559R%h  instead  of  Rfh  computed  by  Fquation  (2  1) 


3.2.3.  Irregular  shapes  of  conductors 

For  other  more  complicated  shapes  in  interconnects,  the  Laplace  equation  must  be  solved 
in  order  to  determine  their  resistances.  However,  this  is  not  done  in  our  extractor  in  order  to 
save  CPU  time.  Instead,  we  use  a  rectangle  partitioning  algorithm  mentioned  in  the  previous 
chapter  to  estimate  resistances  of  some  irregular  shapes.  For  instance.  Figure  3.4(a)  show's  an 
actual  interconnect  layout.  After  partitioning  one  rectangle,  a  new1  node  number  must  be 
inserted,  and  the  result  is  shown  in  Figure  3.4(b).  The  resistance  of  the  rectangle,  which  abuts 
the  other  three  rectangles,  is  approximated  by  half  of  the  resistance  determined  by  the  above- 
mentioned  formula  If  the  Laplace  equation  is  solved  for  this  configuration,  we  have  the  more 
accurate  result  shown  in  Figure  3.4(c).  For  comparison,  mis  circuit  can  be  transformed  into  one 
equivalent  to  Figure  3.4(b).  as  shown  in  Figure  3.4(d).  Although  our  assumption  is  crude,  the 
error  is  still  tolerable 

3.2.4.  Highly  irregular  shapes  of  conductors 

If  a  small  portion  of  a  highly  irregular  shaped  conductor  requires  the  high  accuracy  of  the 
resistance  calculation,  the  only  feasible  method  is  to  solve  the  2-dimensional  Laplace  equation 
by  the  numerical  program  FEMRC  described  in  Chapter  2.  At  present  this  program  is  not 
included  in  our  extractor.  The  designer  has  to  interactively  simulate  the  critical  part  of  the 
layout,  then  insert  the  extracted  resistance  network  back  to  the  circuit  extractor  output.  The 
combined  circuit  description  is  then  ready  for  the  circuit  simulation.  Due  to  the  required  user 
interaction  and  the  fairly  heavy  computation  by  FFMRC.  this  approach  might  not  be  very 
practical 

FFMRC  is  also  able  to  simulate  two-dimensional  capacitances  by  adopting  that  option  in 
running  the  program.  Therefore.  FFMRC  can  also  be  used  to  verify  the  2-d  analytical  capaci¬ 
tance  formulas  which  will  be  discussed  in  Sec  ion  3.3. 
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f  igure  3.4  A  resistance  calculation  example:  (a)  A  conductor  layout 
(b)  An  equivalent  resistance  network  calculated  by  our  method, 
(c)  A  resistance  network  calculated  by  FEMRC.  and  (d)  An 
equivalent  network  for  the  network  shown  in  (c). 


VVW*!1 


M  i»  *j.i  1 1,1  n  ii  « ij 


. V.  V  V  VZ.  V  XT't*x.wVwv  »V*V  »V»V*r»  w;  ■■'V 


42 


3.2.5.  Contact  resistance 


As  MOS  technology  scales  down  into  the  submicron  regime,  the  series  resistances  contri¬ 
buted  by  the  source-drain  and  polysilicon  contacts  may  become  important.  The  contact  resis¬ 
tance  R  is  defined  as  the  resistance  between  two  different  materials  in  the  contact  region.  The 
most  commonly  used  contact  resistance  is  computed  by  the  following  equation. 


L 

R  =  - : - - 

W  tanh(  I,  //.  ) 

where  L  is  the  length  of  window  contact.  VV  is  the  width  of  window  contact  which  is 

dicular  to  the  current  flow,  and  L  is  the  transfer  length  and  has  the  following  form: 

1/3 


P, 


(3  4) 


perpen- 


(3.5) 


where  p  is  the  specific  contact  resistivity,  the  physical  parameter  which  governs  the  interfacial 
contact  resistance  between  two  different  materials. 


liquation  (3.4)  is  derived  from  the  one-dimensional  model  and  is  usually  used  in  a 
transmission  line  tap  test  structure  [30]  to  determine  p  .  However,  in  this  model  it  is  assumed 
that  the  specific  contact  resistivities  for  diffusion-metal  and  polysilicon-metal  contacts  are 
already  given.  If  the  direction  of  the  current  flow  is  known.  Hquation  (3.4)  then  can  be  applied 
to  estimate  the  contact  resistance  in  the  circuit  extraction.  Unfortunately,  this  information 
will  not  be  known  until  the  simulation  is  performed.  Therefore,  the  lollowing  zero-dimensional 
analytic  equation  is  adopted  to  compute  the  contact  resistance. 


P. 

R  =  — 
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(3.61 


where  A  is  the  contact  area.  This  equation  assumes  that  the  potentials  at  the  contacted  materi¬ 
als  are  constant,  and  the  current  density  entering  the  contact  window  is  uniform.  It  can  be 
show  n  that  Hquation  ( 3.6  1  is  a  limiting  case  of  Hquation  ( 3.4 )  w  hen  I.  » 1. .  If  a  more  accurate 


estimate  lor  the  contact  resistance  is  desirable,  the  numerical  mode!  embedded  in  HHMRC  or 


even  a  three-dimensional  model  should  be  employed.  However,  in  our  extractor  only  Equation 
(36)  is  applied. 

3.3.  Capacitor  Models  Used  in  the  Extractor 

In  calculating  capacitance  of  the  interconnects,  basically,  two  types  of  capacitances  are 
considered:  the  self-capacitance  (line-to-ground  capacitance)  and  the  coupling  capacitance.  Cou¬ 
pling  capacitances  can  be  further  categori2ed  into  two  types:  (1)  the  parallel-edge  capacitance 
lor  two  conductors  on  the  same  mask,  and  (2)  the  overlap  capacitance  for  two  conductors  on 
two  different  masks. 

3.3.1.  Empirical  formulas  for  line-to-ground  capacitances 

To  accurately  predict  the  VLSI  performance  during  the  preliminary  circuit  design  phase, 
accurate  estimation  of  parasitic  capacitances  associated  with  interconnects  is  inevitable.  In  this 
section,  we  will  concentrate  on  the  derivation  of  one  of  the  most  important  parasitic  capaci¬ 
tances.  i.e  .  line-to-substrate  capacitance  or  self-capacitance.  Although,  generally,  two-  or 
three-dimensional  numerical  calculations  using  Green’s  function  [37]  or  the  finite-element 
method  as  described  in  Chapter  2  are  able  to  predict  the  line-to-substrate  capacitance  of  a  con¬ 
ductor  in  a  homogeneous  medium  or  on  several  layers  of  the  media,  they  are  prohibitively 
expensive  to  be  incorporated  into  CAD  programs.  A  simple,  accurate  and  closed-form  expres¬ 
sion  to  approximate  the  line-to-substrate  capacitances  is  thus  necessary  and  critical  in  this 
situation. 

Several  closed-form  formulations  of  a  single  conductor  in  a  homogeneous  medium  have 
been  proposed  [3S-40],  However,  they  are  either  computationally  complicated  or  not  always 
accurate.  In  this  section,  a  simple  and  accurate  formula  lor  the  line-to-substrate  capacitance  of 
a  single  conductor  in  a  homogeneous  medium  is  described.  In  the  formulation  the  two- 


dimensional  effects,  e.g..  fringing  fields  and  the  thickness  of  a  conductor,  are  taken  into  account, 
f  urthermore,  an  attempt  is  made  to  include  the  effect  of  the  slant  sidewalls  of  a  conductor  into 
the  formulation.  To  confirm  the  accuracy  of  the  reported  formula,  programs  such  as  CAP2D  or 
I  EMRC  are  used  to  generate  the  reference  values.  To  evaluate  the  slant-sidewall  effect  on  the 
reported  formula,  examples  with  various  combinations  of  design  parameters  are  examined, 
f  inally,  a  simple  modification  of  the  approximate  formula  suitable  for  two  layers  of  media  is 
discussed. 

3.3. 1.1.  Formulation 

The  capacitance  of  a  conductor  over  a  substrate  shown  in  1'igure  3.5  is  determined  by  the 
thickness  t  ,  the  width  w  of  a  conductor,  the  distance  h  from  the  ground  plane  (the  substrate 
usually  taken  as  the  ground),  and  the  angle  a  of  the  slant  sidewalls.  Intuitively,  this  line-to- 
substrate  capacitance  is  composed  of  two  parts,  one  with  i  =0  and  the  other  resulting  from  the 
sidewalls  of  a  conductor.  If  the  capacitance  with  c  =0  and  the  capacitance  due  to  the  sidewalls 
of  a  conductor  are  correctly  modeled,  then  the  line-lo-substrale  capacitance  is  simply  given  by 
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C  =  C,  +  C,  (3.7) 

where  C,  is  the  capacitance  of  infinitesimally  thin  parallel  plates  (r  =0).  including  the  f  ringing 

field  ellect.  and  C\  is  the  capacitance  due  to  the  sidewalls  of  a  conductor 

lor  the  capacitance  ot  the  very  thin  parallel  plates  (i  =0).  including  the  fringing  field 
effect,  the  following  formula  is  first  considered  [41] 
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The  first  term  represents  the  parallel-plate  capacitance  while  the  second  term  is  the  fringing- 
field  capacitance.  However,  this  formula  is  only  valid  when  the  width  w  is  much  greater  than 
the  distance  h  from  the  ground  plane.  The  errors  are  more  pronounced  when  the  width  w  is 
comparable  to  the  distance  h  .  the  usual  case  for  VLSI  circuits.  Hence,  other  formulas  which 
have  been  developed  for  the  microstrip  using  a  modified  conformal  transformation  method  (for 
t  —(I)  are  employed  [42]. 
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The  fringing-fieid  capacitance  C,  of  a  single  conductor  with  t  =0  is  given  by 
C,  — C 


= 


(3.11) 


he  capacitance  C ,  is  the  parallel-plate  capacitance  given  by  the  following  simple  equation: 


C,  =  e 


(3.12: 


As  for  the  sidewall  capacitance  of  a  single  conductor,  the  following  formula  which  models 
:  jco  v.t.nducl:ng  plates  at  a  certain  angle  is  used  [4.3] 
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where  lx.l-,  and  6  are  the  lengths  and  the  angle  as  shown  in  Figure  3.6.  Since  the  f ringing-field 
effects  at  the  ends  are  neglected  in  the  above  formula,  the  computed  capacitance  always 
underestimates  the  actual  one  But  this  underestimation  will  be  compensated  by  the  capacitance 
of  thin  parallel  plates  (t=0)  mentioned  when  summing  up  the  total  line-to-substrate  capaci¬ 
tance.  Therefore,  the  total  line-to-substrate  capacitance  per  unit  length  of  a  single  conductor  is 
given  by 


C  =  e— -+e  1 .393-F0.667Zn  — +  1.444  H - - — In  1  +  —  .  w 

h  h  n-a  i , 


w 

C  =  e — +€ 


(3.14) 


2v 

IV 

&h 

w  h 

In  — -  + 

— 

vv 

Ah 

(3.15) 


It  is  noted  that  the  factor  2  in  the  sidewall  capacitance  term  accounts  for  the  two  sidewalls  of  a 
single  conductor.  Also  the  capacitance  is  reduced  to  the  capacitance  of  thin  parallel  plates  C. 
when  1=0.  in  accordance  with  the  physical  interpretation. 


Figure  3.6  Two  plates  with  a  certain  angle 
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3.3. 1.2.  Comparison  and  discussion 

In  order  to  compare  the  accuracy  of  the  various  proposed  approximations,  capacitance 
values  obtained  by  two-dimensional  numerical  computations  using  Green  s  t  unction  were 
chosen  as  the  reference  values.  Capacitances  normalized  with  respect  to  the  parallel-plate  capa 
citance  obtained  by  numerical  calculations  and  our  approximation  with  a  =  90°  are  depicted  in 
f  igure  3.7  for  t  /h  =0  1.  1 .  and  10.  respectively.  For  t  Ih  =  0  1  and  1  our  approximated  values 
are  within  3 ft  of  the  reference  values  except  for  w/h  «  1  This  is  because  the  formula  for  the 
capacitance  of  thin  parallel  plates  used  in  our  approximation  is  highlv  inaccurate  when 
w/h  <<  1.  In  VI. SI  circuits,  however,  this  is  seldom  the  case.  For  t  Ih  =  1 0  the  observ ed  errors 
are  w  ithin  b'v  of  the  reference  values  again  except  for  w/h  «  1  for  the  same  reason  as  before 

In  Figures  3  H.  3.9  and  3  10.  the  percentage  error  with  a  =  90°  is  plotted  for  r  7i  =  0.1 
I  :h  =  1  and  t  ,'h  =  10.  respectively .  for  Flmasry  s  approximation  [39],  the  cylindrical  approxi¬ 
mation  [40],  and  our  approximation  These  figures  show  that  over  a  wide  range  of  design 
parameters,  i.e..  the  thickness  and  the  width  of  a  conductor  or  the  distance  of  a  conductor  from 
the  substrate,  the  results  seem  to  be  much  better  when  our  formulas  are  used  to  compute  the 
capacitance  of  a  conductor  w  ith  the  sidewall  angle  a  =  90  In  order  to  study  the  capac  itance  ol 
a  conductor  with  slant  sidewalls  a  two-dimensional  numerical  program  which  includes  the 
slant  sidewall  effect  was  used  to  generate  the  reference  values  Fable  3  1  lists  the  percentage 
errors  lor  conductors  with  a  sidewall  angle  a  =  7()T  and  the  results  show  a  general  agreement 
bet  ween  the  approximate  values  and  the  reference  values  Also  indicated  in  I  able  i  1  are  the 
percentage  errors  between  calculations  neglecting  the  slant -sidewall  effect  in  our  approximate 
formulas  'o  -  90  )  and  calculations  hv  the  two  dimensional  program  w  ith  o  ^  'o  I  he  oim 
par, sens  show  that  except  for  f  h  =  O  1  the  percentage  error  is  greater  I  >r  ur  approximate  lor 
mulas  where  the  sidewall  angle  n  is  not  taken  .nto  consuler.il  ion  !  here!  -re  ,r.  order  mam 
tain  '  he  accur.ic  .  of  >ur  proposed  formulas,  it  :s  ir.p'r’ar.t  '  Mke  •  ne  stan:  aifewaii  etled  .nt. 
account 
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Figure  3.8  Comparison  of  percentage  errors  for  t/h  =  0.1. 


Table  3.1  Percentage  errors  of  our  sell -capacitance  formulas 
for  a  conductor  with  a  sidewall  angle  a  =  70 


Percentage  Error  (a=70u) 

w/h 

t/h  =  10 

t/h  =  5 

t/h  =  3 

t/h  =  1 

t/h  =  0.1 

1 

- 

- 

- 

0.33 

(4.67)* 

-1.82 

(-1.11) 

2 

- 

- 

- 

0.97 

(4.24) 

-1.34 

(-0.83) 

5 

- 

6.48 

(11.14) 

4.73 

(8.48) 

1.70 

(3.65) 

-0.25 

(0.08) 

10 

6.92 

(10.71) 

5.12 

(8.02) 

3.88 

(6.18) 

1.63 

(2.81) 

0.29 

(0.45) 

* percentage  error  for  a=90°. 


In  the  previous  discussion  we  have  assumed  a  single  dielectric.  In  an  actual  configuration, 
however,  the  conductor  is  situated  on  top  of  the  oxide  (Si02)  and  capped  with  a  passivation 
layer,  e.g..  Si\  as  shown  in  Figure  3.11.  The  difference  between  dielectric  constants  should  also 
be  taken  into  account  in  the  approximate  formulas  to  maintain  accuracy.  By  using  the  similar 
treatment  as  [ 40] .  weighting  factors  for  the  "parallel-plate"  term  and  the  "fringing"  term  are 
chosen.  For  two  layers  of  dielectric  media  the  line-to-substrate  capacitance  is  thus  given  by 


Figure  3.11  A  conductor  surrounded  bv  two  different  dielectrics. 
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3.3.2.  Coupling  capacitance 

We  shall  now  turn  to  the  coupling  capacitance  calculation.  For  the  case  as  shown  in  Figure 
3.12.  the  following  simple  formula  for  parallel-edge  capacitance  [35]  is  used. 

0.222.  -1  34 
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where  w  is  the  width  of  the  conductor,  t  is  the  thickness  of  the  conductor,  h  is  the  height  of 
the  conductor-to-ground  plane  and  s  is  the  separation  between  two  conductors.  The  percentage 
error  of  the  above  formula  is  within  10%  of  the  referenced  value  resulting  from  2-d  calcula¬ 
tion. 


As  for  the  case  in  which  two  parallel  conductors  are  on  different  mask  levels  as  shown  in 
Figure  3.13.  the  following  formula  for  the  overlap  capacitance  derived  in  [27]  is  used. 
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Figure  3.12  Two  conductors  at  the  same  layer  over  ground  plane. 
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Figure  3.13  Two  conductors  with  overlapping. 
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where  y  =  - .  /3  =  - .  and  €  =  (€ t ~F€2 )/2 ■  This  formula  accounts  for  different  dielectric 
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constants  by  using  a  weighting  factor. 

Finally,  the  formula  for  cross-over  coupling  capacitance  as  derived  in  [27]  is  employed  in 
our  extractor,  which  is  given  by 
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This  formula  is  obtained  by  fitting  data  calculated  by  a  3-dimensional  numerical  program.  In 
order  to  reduce  variables  in  curve  fitting,  two  assumptions  are  made.  First,  thickness  and 
widths  are  taken  as  the  same  for  two  conductors.  Secondly,  the  length  is  five  times  that  of  the 
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width.  This  assumption  is  a  reasonable  one  because  most  of  the  fringing  field  is  close  to  the 
overlapped  region  between  two  conductors.  The  nominal  percentage  error  is  1  Off  as  compared 
to  the  numerical  calculation.  However.  if  the  thickness  and  width  of  both  conductors  are 
different,  this  percentage  error  might  increase. 

3.4.  Lumped  RC  Circuits  for  Interconnect  Modeling 

Once  resistance  and  capacitance  values  are  determined,  a  lumped  circuit  model  is  used  to 
approximate  the  behavior  of  distributed  RC  lines.  In  [44],  it  has  been  concluded  that  7r-circuit 
models  give  satisfactory  results  in  the  approximation  of  a  distributed  RC  line  Therefore,  a  tt- 
circuit  model  is  chosen  to  approximate  every  branch  in  the  interconnect.  Furthermore,  in  order 
to  compromise  between  the  number  of  nodes  created  and  the  accuracy  of  modeling,  a  resistance 
threshold  /?_  is  used  to  select  a  one-  or  a  two-section  rr-lumped  circuit  in  modeling  distributed 
interconnects.  However  for  the  insertion  of  coupling  capacitance,  a  T-like  circuit  model  is 
chosen  for  simplicity.  Figure  3.14  gives  an  example  showing  how  we  use  the  lumped  circuit 
model  to  approximate  distributed  RC  lines  which  include  the  coupling  capacitance. 

3-5.  Conclusions 

In  this  chapter,  the  resistance  and  capacitance  computation  models  used  in  HPEX  are 
developed  In  modeling  an  interconnection  line,  analytical  formulas  obtained  by  fitting  the 
numerical  data  are  first  employed  to  compute  the  resistance  and  capacitance  values.  These 
values  are  then  distributed  into  two  sections  of  a  lumped  circuit.  Analytical  formulas,  instead 
of  more  accurate  numerical  methods,  are  adopted  in  HPFX  to  significantly  reduce  the  computa¬ 
tion  time.  By  carefully  fitting  the  analytical  formulas,  the  produced  errors  can  be  controlled 
within  the  tolerance  limit. 


The  same  filling  technique  can  be  applied  to  derive  the  closed-form  formulas  for  coupling 
capacitance  calculation.  The  accuracy  of  the  fitting  formulas  are  somewhat  reduced  because 
most  of  the  coupling  capacitances  are  three-dimensional  in  nature  Furthermore,  some  accuracy 
mac  be  lost  due  to  the  fact  that  self-  and  coupling  capacitances  are  separately  computed  in 
these  models.  In  reality,  both  self-  and  coupling  capacitances  are  strongly  coupled  together  and 
cannot  be  separated.  However,  these  computation  models  are  able  to  reduce  the  computer 
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resources  bv  predicting  the  first-order  parasitic  effects.  If  a  higher-order  accuracy  is  required,  a 
numerical  program  such  as  f  IMRC  should  be  included  in  HPEX.  But  for  real  circuits  with  a 
large  number  of  parasitics.  the  incorporation  of  FEMRC  into  HPEX  may  not  be  practical. 
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CHAPTER  4. 


DATA  STRUCTURE  AND  BASIC  ALGORITHMS  IN  HPEX 


4.1.  Technology  Assumptions 


Technology  assumptions  embedded  in  HPEX  can  be  described  as  follows: 

( 1 )  MOS  technology  :  Either  N'MOS  or  CMOS  circuit  layouts  are  assumed. 

(2)  Manhattan  style  layout  :  Only  isotheiic  (or  rectilinear)  polygons  are  allowed  as  input  to 
HPEX  All  boundary  segments  of  isothetic  polygons  are  parallel  to  .v-axis  or  y-axis. 
furthermore,  all  isothetic  polygons  are  split  into  a  equivalent  set  of  rectangles  stored  in 
the  internal  data  structure  of  HPEX 

(3)  No  closed  loop  transistors:  Only  simple  or  serpentine  shape  transistors  can  be  specified  in 
the  input  file 

(4)  One-  or  two-layer  metal  process  No  contact  cuts  are  allowed  between  the  second  layer 
metal  and  polysilicon  or  diffusion  layer  I  he  second  layer  metal  is  electrically  connected 
to  the  polysilicon  or  diffusion  conductors  only  through  the  first  layer  metal. 

(5)  Planar  interconnect  process  .  There  is  no  step  coverage  in  the  interconnect  conductors. 
Although  this  assumption  simplifies  resistance  and  capacitance  calculation,  it  is  somew'hat 
invalid  because  most  of  present  semiconductor  technologies  still  use  the  conventional 
interconnect  process  with  step  cov erages  The  planar  interconnect  technology  will  substan¬ 
tially  complicate  the  fabrication  process  However,  the  planar  interconnect  process  will 
fiecome  the  choice  of  technology  due  to  the  requirement  for  multi-level  interconnects  and 
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high  packing  density  in  VLSI  circuits.  Some  of  advanced  GaAs  circuit  technologies  have 
already  demonstrated  the  advantages  of  using  the  planar  interconnect  technology. 


4.2.  Geometrical  Data  Structure  and  Algorithms 


In  HPEX.  a  layout  is  represented  as  a  collection  of  rectangles.  All  geometric  manipula¬ 
tions.  such  as  boolean  mask  operations,  are  based  on  rectangles  In  general,  a  large  number  of 
rectangles  are  needed  to  describe  the  layout  of  a  typical  VLSI  circuit.  Therefore,  from  the 
extractor  performance  point  of  view,  to  design  an  efficient  rectangle  data  structure  for  HPEX  is 
very  crucial.  A  linked  list  of  rectangles,  though  simple  to  implement,  is  not  efficient  enough  for 
region  queries.  Region  queries,  frequently  used  in  HPEX.  are  defined  as  operations  of  finding  all 
rectangles  intersecting  a  specified  region.  The  geometrical  data  structure  employed  in  HPEX  is 
called  a  4-d  binary  search  tree  [45-46].  The  basic  concept  behind  a  4-d  binary  search  tree  is 
based  on  a  divide-and-conquer  technique.  Each  node  of  the  tree  corresponds  to  a  rectangle  in 
the  layout.  At  the  root  of  the  4-d  binary  tree,  the  space  is  split  in  half  by  the  left  edge  of  the 
rectangle  corresponding  to  the  root  node.  At  the  next  level,  the  remaining  space  is  cut  in  halt  b  . 
the  bottom  edge  of  corresponding  rectangles.  Bv  a  similar  procedure  the  right  edge  and  then  'he 
top  edge  is  used  to  divide  space  in  half,  figure  4  1  shows  a  layout  example  consisting  o!  1  1  rev. 
tangles  and  Us  corresponding  4-d  binarv  tree. 


The  Pascal  data  structure  for  4-d  binarv  trees  used  in  HP!  X  i-  defined  as  tollo-x' 


rectptr  =  rect: 


red 


=  record 

key  :  array[0.  3]  of  real. 

(*  four  kevs 
key[0)  =  xmin 
key  [  1  ]  =  y min 
key[2)  =  -xmax 
key] 3]  =  -  .  max  * ) 

mask  mask  level.  (*  Mask  le.e:  red  a • 
Ison.  ( *  let  t  son  in  tree  strut  t  ure  *  1 
rson  rectptr;  I*  rtght  son  ;n  tree  •  irud 
ndisc  :  integer;  <*  discriminator  ,  r 
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in  the  input  phase  of  HPEX.  4-d  binary  search  trees  are  first  built  according  to  each  mask 

lecel.  Then  various  searches  are  performed  based  on  these  trees,  either  identifying  transistor 

channels  or  finding  all  electrically  connected  rectangles.  In  general,  two  basic  algorithms  are 

needed,  insertion  and  intersection  queries.  An  insertion  algorithm  is  used  to  build  as  well  as 

update  4-d  binary  search  trees,  while  an  intersection  search  algorithm  is  to  find  all  rectangles  in 

a  tree  which  are  intersected  by  a  given  region  The  insertion  algorithm  used  in  HPTX  is 

described  as  follows. 

ALGORITHM  4.1  4DINSERT 

Input  a  node  P  and  a  4-d  binary  search  tree. 

Output  a  new  4-d  binary  search  tree  with  node  P  inserted. 

11  [Check  for  emptv  tree] 

It  ROOT  =  ml  then  set  ROOT  =  P.  DISC(P)  =  0. 

rsont  P)  =  ml.  Ison(P)  .=  nil.  set  rson(P)  as  thread  and  return 

(  Hherw  ise  Q  =  ROOT 

12  [Compare] 

l!  ke\ ,  [t  ]  =  ke\\ , [i  ]  for  0  <=  i  <=  3 

1  i  e  the  nodes  are  equal)  then  return;  otherwise. 

if 

(  1  )  key  AD  /  SC  (Q  )]  >  kex^D/SCiQ  )]; 
or  i  2  )  key,  [DISCiQ  )]  =  kcy^[D/SC(Q  )]; 

key  A( D/SC (Ql+Dmod^)  >  keyJ(DJSC(Q  )+DmoJ  4]; 
or  (  3  '  key,  [D/SC(Q  )]  =  key^DlSCAQ  )]; 

key ;,[( DISC  <Q  )+l)m<></4]  =  key<t[U)lSClQ  )+l  )mod  4]; 
ke\)\(DlSC{Q)+2)mocn]  >  key^ [ ( DISC ( (2  ) + 2 )rm x/ 4 ] . 
or  (41  key,  [DISC(Q  )]  =  key^lD/SCiQ  )]; 

keyp[(D/SC(Q  )+l  )mod  4]  =  kcx^iDISC  (Q  )+l  )nu>d  4]; 
key;,[(DlSC  IQ  )+2  )mod  4]  =  keyJw/SC  (Q  )+2  )mod  4], 
keyP[(DISC(Q  )+i)nwd  4]  >  key^W/SCiQ  )+3)rW  4]; 
then  set  son(y)  :*=  rson(Q) 
else  set  son(Q)  :  =  Ison(Q). 

If  son(Q)  =  nil.  then  go  to  14 

13  (Move  down] 

Set  0  :=  son(Q)  and  go  to  12. 

14  [Insert  new  node  in  tree] 

Set  son1  y  1  :=  P.  rson(  P)  :=  ml.  lson(  P)  -  ml  and  ret  urn 

In  general,  a  i>uilt  tree  is  not  balanced  in  height  if  a  random  order  of  rectangles  is  used  as 
an  input  Bv  employing  the  above  algorithm,  f  igure  4  2  illustrates  a  4-d  hinarv  search  tree 
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corresponding  to  the  layout  shown  in  Figure  4.1.  This  tree  structure  is  different  from  that 
shown  in  Figure  4.1  because  in  the  actual  implementation  key[ 2]  and  *<?}'[3]  are  defined  as  nega¬ 
tive  of  the  right  and  upper  bounds  of  a  rectangle.  It  should  be  noted  that  lor  applying  the 
above  algorithm  a  different  insertion  order  of  rectangles  may  produce  a  different  tree  structure. 
As  an  example.  Figure  4.3  shows  a  different  4-d  binary  tree  corresponding  to  the  same  layout 
described  in  Figure  4.2.  Since  a  built  tree  structure  varies  with  the  input  sequence  of  rectan¬ 
gles.  we  may  have  a  purely  directed  path  as  the  worst  case  in  applying  4DINSERT. 

Another  algorithm  to  build  a  4-d  binary  search  tree  attempts  to  balance  the  height  of  the 
tree  by  some  preprocessing  steps  at  each  level  of  the  tree  structure.  This  alternative  first  reads 


in  all  rectangles  and  stores  them  in  a  linked  list.  This  linked  list  is  then  recursively  sorted  In 
the  respective  key.  and  broken  into  approximately  two  halves  bv  a  median  node,  (liven  a 
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ALGORITHM  4.2  4DBI  ILCX  A,j,P> 


Inpul.  j  linked  list  ol  nodes  A.  discriminator  j 
and  root  node  I’ 

()uiput  a  balanced  tree  rooted  at  (' 

III  [Check  lorempts  sell 
II  A  is  nil  then  ret  urn 

H2  !  I  i  nd  nteil  ian  ] 

Set  I'  -  i  median  node  ol  A 

II  I  [Split  set  J 

Set  \  I  iO  (•  \  o  ;  less  th.ir'  I" 
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4.3.  Circuit  Netlist  Data  Structure 


4.3.1.  MOS  transistor  record 

Since  our  circuit  extraction  is  oriented  to  the  MOS  technology,  the  record  for  MOS  transis¬ 
tors  must  be  defined  in  order  to  store  the  extracted  information  about  the  transistors.  The 
transistor  record  is  defined  as  follows: 
tranptr  =  "transistor: 
transistor  =  record 

ttype  :  nmostype:  (*  type  of  the  transistor  *) 
w  idth. length  :  real:  (*  width  and  length  of  the  transistor  *) 
source. drain  :  adjptr: 
gate  pdaptr: 

dnode.snode.gnode  :  integer:  (*  node  numbers  for  drain,  source  and  gate  *) 
gcap  real:  (’  WLC„,  *) 

dcap  real.  (*  capacitance  associated  with  drain  node  *) 
scap  real:  (*  capacitance  associated  with  source  node  *) 
drainnet.sourcenet.gatenet  .  netptr: 
next  :  tranptr. 
end: 

In  the  above  record,  most  ol  the  parameters  are  self-explanatory  The  source,  drain  and  gate 
pointers  are  heads  ot  lists  of  rectangles  corresponding  to  the  source,  drain  and  gate  regions, 
respective!;,  This  information  is  eery  important  when  we  estimate  the  equiv  alent  channel 
length  a  id t h  source  and  drain  areas  of  the  extracted  MOS  devices.  In  order  to  easily  letch 
'hree  lerminal  net'  .>1  a  particular  transistor,  sourecnet  .  drainnet  and  gaienet  are  defined  as 
[snnters  u,  source  drain  and  gate  nets  respect  iv  el  v  It  is  also  noted  that  the  drain,  source  and 


f 


.'ate  rvde  rumners  are  kept  in  the  integers  drunie  'node  and  gm*/ e.  and  need  to  be  updated 
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netpir  =  net: 
net  =  record 

ntype  .  netlype; 

namenode  .  ionodeptr:  (*  character  name  *) 
totalres.totalcap  :  real: 
tranconset  :  tranlinkptr: 

reduction  :  boolean:  (*  need  node  reduction7  *) 
bound  :  array  [0.  3]  of  real; 
nextnet  :  netpir, 

number  :  integer;  (*  net  number  *) 
geometry  :  gmyptr: 
branch  :  branptr: 

resistor. selfcap  :  crptr:  (*  lists  of  resistors  and  capacitors  *) 
couplingnet  :  cpnetlinkptr;  (*  coupling  nets  *) 
end; 

The  pointer  tranconset  points  to  a  list  of  connected  transistors  by  the  net.  This  information  is 
useful  when  we  need  to  traverse  the  circuit  schematic.  A  list  of  rectangles  which  constitutes 
the  net  is  stored  in  the  geometry  pointer.  These  rectangles  are  processed  through  the  RC  model 
module  to  generate  interconnect  parasitic  resistances  and  capacitances  If  the  net  is  connected  to 
more  than  two  transistors  then  branches  constituting  the  net  are  pointed  by  the  branch 
pointer  Another  parameter  used  is  bound .  which  defines  the  boundary  of  a  net  Before  calculat¬ 
ing  the  coupling  capacitance  between  two  nets,  bound  is  used  to  determine  whether  the  coupling 
is  too  smal’.  to  calculate  This  avoids  computation  of  the  coupling  capacitances  which  have  a 
negligible  effect  on  circuit  performance. 

The  basic  unit  in  interconnect  RC  modeling  for  HPF:\  is  called  branch.  A  branch  is 
defined  as  a  set  of  electrically  connected  rectangles,  in  which  there  is  no  side  branch  lor  current 
flo  a  1  or  each  branch  two  resistors  and  three  capacitors  are  used  to  approximate  an  RC  distri¬ 
buted  line  The  branch  record  in  HIM  \  is  given  as  follows, 
branptr  -  branch 

brunch  -  record 

bound  arrav  [<)  H  of  real. 

nodel  node2.nodem  integer  ( *  end  and  middle  node  nundiers  *  1 

resistor  I  resistor2  crptr 

noile Kap  node2cap.nodemcap  crptr 

k>m’.  connect  udiptr 

coupcap  vpconptr 
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next  :  branptr; 
end; 

The  array  bound  also  represents'  the  bounding  box  of  the  branch.  In  coupling  capacitance  calcu¬ 
lation.  if  two  nets  are  close  enough,  then  the  bounding  boxes  of  two  branches  from  each  net  are 
compared  to  filter  out  the  coupling  effect  due  to  a  large  distance.  This  process  proceeds  until  all 
branches  have  been  compared.  This  hierarchical  comparison  for  the  bounding  boxes  of  nets  and 
branches  will  substantially  reduce  the  number  of  rectangles  required  for  comparison  during 
coupling  capacitance  calculation. 

4.4.  Layout  Resizing  Algorithms 

Because  of  the  important  role  played  by  chip  layout  feature  sizes  in  circuit  performance, 
actual  dimensions  of  devices  and  interconnects  should  be  correctly  estimated.  If  the  CIF  input 
represents  a  mask  layout,  then  the  difference  between  the  mask  lav  out  and  the  actual  dimen- 
Mon  on  the  physical  layout  resulting  from  process  variations  should  be  taken  into  account  dur¬ 
ing  circuit  extraction.  The  fabricated  regions  are  usually  different  from  the  regions  ir.  the  mask 
lav  out  by  a  uniform  width  around  the  periphery  of  the  regions.  For  example,  the  diffusion 
regions  are  larger  than  the  mask  layout  due  to  the  inherent  lateral  diffusion  ol  ions  during  the 
lubrication  process.  Nevertheless,  the  polysilicon  regions  might  be  smaller  than  the  designed 
mask  because  ol  the  process  bias.  This  width  of  contraction  or  expansion  becomes  significant  in 
determining  the  critical  leature  size  ol  the  circuit  layout  as  the  VLSI  chip  becomes  larger  and 
denser  Thus  a  *.apabihtv  to  resize  the  regions  of  a  VI  SI  layout  before  circuit  extraction  is 
important  tor  the  la  .  out  verification 

<n.en  the  regions  .  ov ered  bv  a  set  ol  rectangles,  several  approaches  have  been  proposed  to 
,  r’r.ui  <r  to  -\pund  t  hese  regions.  Tv  pica Ilv  .  a  lavoul  resizing  procedure  can  he  described  as 
id  c  -  1  r.t  ’he  u’.'ir.es  ' ,  •  r  boundaries  ’  ,<i  ;  he  disjoint  regions  are  detec  ted  1  hen  the  true 
"  uiut.i:  es  it  ■  hi,,  resized  •  eg i  m-  could  lie  t  rmed  h.  ..  ntrading  for  expanding1  the  boundurv 


segments  inwardly  (or  outwardly)  through  parallel  displacements.  If  the  final  output  is 
represented  by  an  equivalent  set  of  rectangles,  a  decomposition  algorithm  which  transforms 
polygons  into  rectangles  is  required  Therefore,  two  different  data  structures  are  necessary  to 
support  the  resizing  algorithm:  (1)  a  rectangle  data  structure  and  (2)  a  polygon  data  s”-ucture. 
An  algorithm  employed  to  maintain  these  data  structures  during  structural  transformation 
could  be  complicated  and  difficult  to  implement.  Hence  a  simple  and  efficient  resizing  algorithm 
based  only  on  one  rectangle  data  structure  is  proposed  to  contract  or  expand  the  input  circuit 
layout.  This  resizing  algorithm  is  implemented  in  the  extractor  HPEX. 

In  the  remaining  part  of  this  section,  important  definitions  will  first  be  given  in  order  to 
describe  the  layout  resizing  problem.  Some  of  the  previous  approaches  will  then  be  reviewed, 
followed  by  a  discussion  of  our  proposed  resizing  algorithm  finally,  other  potential  applica¬ 
tions  of  the  proposed  algorithm  will  be  described 

4.4.1.  Definitions 

Since  a  rectangle  is  chosen  as  a  primitive  in  our  internal  data  structure,  the  following 
definitions  are  mainly  aimed  at  the  relationships  between  rectangles 
Definit ton  4  1  lM>rhetic  Rectangle 

An  ;  wt hetic  rectangle  is  a  rectangle  with  its  edges  parallel  to  eil.'ier  ,x  -  axis  nr  y  -axis  In 
the  remaining  part  of  this  chapter,  the  term  rectangle  represents  isothetic  rectangle  for  brev - 
1 1 ;. 

f  )e!m:tu<n  4  2  (  h  etlap 

I  x  '  rectangles  are  .  er lapped  if  thev  have  a  common  area 
/ ‘eft >n  1  '  \t’u:  in  tie  '  '.a  .  1  Direct  i- -ft 

i  a  rectancies  are  abutted  in  the  >  1  ,-r  .  il.iAt,.  n  if  »i:ev  :.a  .  e  a  c-ntm  n  ’  ;v  men" 


Definition  4  4  Equivalence 


A  set  ol  rectangles  R,  is  equivalent  to  another  set  ol  rev  tangles  R  it  thee  vo.er  '.tie  same 
regions 

Definition  4  5  Ex  pension  lor  Positive  (Iff  setting  I  hy  Width  J 

1  el  S  be  a  rectangular  region.  The  expanded  region  ol  S  hy  w  kith  d  ,s  defined  as  loiiow  - 

EjS  =  {a  €IT|v  6S.  la— i  I  <u  !  41 

whereE;  is  an  expansion  operator  w  Inch  expands  a  region  In  w  idth  d 

f  or  example.  Figure  4.5  shows  a  rectangle  (solid  line'  and  its  expanded  region  d.-"ed 
line)  It  can  lie  observed  that,  by  applv  ing  the  above  definition  the  expanded  :eg  m  have  ■ 
cular  arcs  ol  radius  d  which  inevitable  complicate  the  resizing  algorithm  and  s.ipp,  :  •  n. 
data  structure  As  a  result,  in  order  to  simplil  .  the  resizing  aiz  rthm  and  data  ■'•..ware  n- 
following  loose  definition  for  expansion  ol  a  rectangle  is  used  here.d  :>•' 


V  « 


Di’tinLiion  4  b  Pat  allei  /:  \ rKinsion  t>\  Width  d 


1  et  S  =  !u  .v  ) 6 R'|  \  ,  ^  v  ,  and  y  ,  ^  y  ^ \  , !  be  a  rectangle  in  the  plane  R"  The  paral¬ 

lel  expanded  region  ol  S  by  width  d  is  defined  as  follows: 

E.S  =  i(.i  .y)€R~|.t,—  d  ^.v  ^x-,+d  and  y,—  d  ^y  ^ y2+d  1  .  (4.2) 

x  here  Ej  is  a  parallel  expansion  operator  that  expands  a  region  outwardly  by  a  parallel  move¬ 
ment  ol  boundary  segments  for  a  distance  d 

\  tgure  4  b  illustrates  a  rectangle  and  Us  parallel  expansion.  According  to  this  definition, 
rectangles  remain  rectangles  after  expansion.  The  same  is  true  for  polygons.  However,  for 
pe  . .  gons  the  parallel  expansion  is  not  as  simple  as  that  for  rectangles,  since  there  might  be  an 
intersection  of  boundarx  segments  after  expansion  as  shown  in  Figure  4.7.  Therefore,  finding 
intersections  lietween  boundary  segments  after  expansion  and  removing  false  boundary  seg¬ 
ment'  become  two  main  steps  in  the  polygon  expansion  algorithm  As  for  rectangles,  we  can 
eas  .  find  ineir  expanded  regions  without  worrying  about  intersections  between  boundary  seg- 
m e " : '  This  is  one  et  the  reasons  that  we  develop  the  resizing  algorithm  based  on  a  rectangle 
da  ’a  't  -ik  t  lire 


( )rigma I  Region 


Parallel 

1  xpanded  Region 


cu  e  4  o  i  a  .  ut  para  .  ie!  expansion 


I  igure  4  7  l.avout  f xpan-i.-r.  *nf.  *  >1 

Definition  4  ~  C". implement  (Operator 

For  arv.  region  R.  a  ^.implement  •  prru!  >r  C  is  «:e'  ned  a--  :  Is 

CR  =  if  x  >  ■  €  K  ‘  1 1 1  ,  (R 

Since  the  contraction  ol  a  rectangle  S  un  *xe  defined  a-  '.he  com  p. err..- -v  '  ’!■** 
complement .  namely  C'  S  =  O  E^.  <  CS  I ..  vv  here  C'.  anil  C'  are  parallel  ..  n:  r,i.  '  n  per  a*  r 
complement  operator  respective  v.  we  a.  ill  not  give  a  separate  definr...  n  a">  i.t  n-c'an.'le 
traction  here  it  follows  immediate!-.  !  rom  the  abi-.e  definitions  that  'he  '  .!•>,*  ing  h, 
operations  a  e  correct 

e;<aUB'  =  e-aUe;b 

C ( A  U  B  1  D  C  j  A  (J  C B  I 

I  he  first  propertv  indicates  that  the  expansion  ol  a  union  ol  the  rectangles  is  equivalent  U 

union  of  the  expansions  ol  each  individual  re. tangles.  From  the  electrical  point  of  view  v'iv 

set  of  rectangles  in  ‘he  same  mask  level,  if  two  rectangles  are  electrical  I connected,  then 


*v 


V.’ 


'0 


I  1  1  On  time  pi  >  x  ('filin' 


•  4  K  1  \  ..  ■  ’  '  i  A'  A  'he  .  >  ;i  i  J  i  .-i<-  pr.  ...edure  is  t<  find  the 

r  1  j  /'  I  .<•'  /  ii.i  .  v  r»si-.i  .1  nr  r  more  disjoint  sonnet  ted 


t  Mr  ut!. nr  1  i  .  :  m-,  !  .n-  mi  k  .vies  s  ••m:  r  uvU'tS  .  rrtital  and  horizon 


’  ’.ind.ird  -I  i;  r  ihn-.  .md  -  detailed  implementation  4  the  outline  procedure  for  \ 
.i  r:  i;  n  >jr  tie  ‘ .  i.ttd  .  n  r  he  :e  \ ;  j  ut  ho  red  1 1 ,  I'reparata  and  Shamo.s  (47)  The  algo 

•*.  ■  ..i-  :  *  ii.ni  r liases  and  uii  or  hriell.  described  as  follows  In  the  first  phase  the  s*.an 

1  ih;  ,e  ..pp>  Med  *- .  hr  ectr.ent  tree  T  ;47J  is  ernpl>o.  ed  to  find  the  set  V  ol  vertical 

'■'-•r  .  •  i.-.r  nt  u  r  l  true  'he  ri  V  h.o  I  wen  obtained  the  seiond  phase  is  to  link  these 

i  r.'n.ei' .  '  ,  means  I  '  ...  .it  a  I  •  e^  men  is  i,-  ;■  rm  the  >ut;me  •’(  / 

'  •  tier  r.e  a  If;  r-.thm  ha s  l'ren  pr  p  sod  iv  Komatsu  and  Su/uKi  [ 4S  J.  The  mam  son 

•:  ’  "etuniJ  ■  ne;r  algorithm  is  h  —  •  l:nd  disjoint  connected  s.’mpwnt.s  <each  component  con 

'  1  a  list  ol  rectangles1  ol  rectangles  allocated  to  sutx.hips  o(  a  ship  area  Then  the  reclan- 

«*••  ’hat  belong  to  me  same  pole  g.  n  are  collected  and  the  outline  segments  are  extracted  B\ 
•■peatim;  this  prosess.  the  outl.ne  procedure  is  carried  out  The  process  of  extracting  the  con- 
'  ur  Irom  a  set  ol  rectangles  is  similar  to  that  mentioned  in  the  first  algorithm  except  that  the 
se;  o|  hi  r:  mtal  edges  instead  of  the  vertical  edges  are  derived  first 


4.4. 2. 2.  Resizing  procedure 

Alter  the  outlines  of  a  given  set  of  rectangles  have  been  delected,  we  can  proceed  to  resize 
these  regions.  This  is  equivalent  to  resizing  a  set  of  polygon  regions.  The  typical  resizing  algo¬ 
rithm  can  be  found  in  [49]  and  is  summarized  as  follows: 


XY  Method 


II]  Scan  the  input  region  vertices  and  sort  them  in  the  ascending  v -coordinate  order.  As 
shown  in  Figure  4.9(a),  the  polygon  is  then  split  into  banded  regions  according  to  this  list 
of  sorted  vertices'. 

[2]  Let  d  be  the  expansion  distance.  For  each  banded  region,  the  distance  from  a  vertical  side 
A  to  its  neighboring  side  B  is  determined.  If  it  is  less  than  2d.  then  the  area  between  A  and 
B  is  added  to  the  figure  and  two  banded  regions  are  merged  horizontally  (Figure  4.9(b)). 

[3]  Fxpand  the  output  regions  from  Step  2  by  d  in  the  x  direction  since  no  intersection  of 
sides  will  occur  (Figure  4.9(c)). 

[4]  Repeat  the  previous  steps  lor  the  y  direction. 

Step  2  can  be  done  by  scanning  the  vertices  from  large  to  small  y  values.  In  each  position  of  the 
scanline.  the  horizontal  test  is  performed  to  find  banded  regions  which  should  be  merged  after 
the  expansion  Also  note  that  this  method  resizes  the  rectilinear  regions  with  x  -axis  and  y-axis 
directions  separately. 

4.4.3.  Our  approach  to  layout  resizing  problem 

For  the  purpose  of  efficient  implementation  of  the  resizing  algorithms,  we  develop  two 
different  algorithms  (expansion  and  contraction)  in  our  approach  instead  of  applying  the  con¬ 
cept  of  region  complement. 

4.4.3. 1.  Expansion  algorithm 

As  mentioned  before  there  is  no  discontinuity  problem  for  the  region  after  a  rectangle- 
bv-rectangle  expansion.  Therefore,  in  the  expansion  problem  we  can  first  expand  everv 


( 


'  fL  tangle  and  then  remoxeall  the  ocerlapped  regions  I  tie  algorithm  *  «■  pi>  T1  s<-  c*  si  r  ,t ,  *-li  1 1  i 
a  .t rd  .irut  ian  he  des..  ■  i  bed  as  t  o i  li >w  s 


ALGORITHM  4.4  Scanline  Method  for  Layout  Lxpansion 

Input  a  list  ot  boxes  B  in  one  mask  lexel  the  shrinkage  distance  </ 
anil  the  feature  si/e  threshold  h 
i  'input  a  list  ol  nonox  erlapping  expanded  boxes  B, 

ivgm 

1  [xpund  exery  box  in  B  by  a  distance  <i 

2  Sort  B  by  top  edge  of  box  in  descending  order 
1  l  se  scanline  sleeping  in  -y  direction  to 

transform  B  into  B,,  which  represents 
a  set  of  maximum  horizontal  strips 
4  So  r  l  B./  bv  right  edge  of  box  in  descending  order 
,s  Set  scanline  at  right  edge  of  layer,  sweep  in  the  »  direction 
to  verticallx  merge  boxes  with  v  direction  feature  si/e  less 
than  predefined  threshold  h  into  contracted  set  B 
(>  return!  B.-  > 

-'nd. 


Step  2  in  the  above  algorithm  is  to  produce  a  unique  layout  representation  with  boxes  adjacent 
to  each  other  in  the  X  direction  It  is  noted  that  in  this  step  the  "scanning  painting  algoriihm 
similar  to  [50]  is  used  to  remove  all  overlaps  for  each  mask  level  in  order  to  correctly  perform 
circuit  extraction  The  purpose  ot  step  5  is  to  construct  the  correct  dimensions  for  all  boxes  to 
be  used  in  the  resistance  model  for  circuit  extraction  f  igure  4.10  shows  an  example  for  step  5 


4.4.3.2.  Contraction  algorithm 

Rectangle-bv-rectangie  contraction  for  the  layout  resizing  might  create  the  electrical 
discontinuity  problem.  Therefore,  traditional  contraction  algorithm  merges  all  boxes  into  dis 
joint  polygons  before  going  to  the  actual  layout  contraction.  However,  in  order  to  simplify  the 
algorithm  and  its  supporting  data  structure,  we  propose  a  contraction  algorithm  which  is  based 
on  a  novel  Y-X  scanline  method  together  with  a  rectangle  data  structure  in  our  extractor  This 


algorithm  is  described  below. 


» 

I 


a  1  (  >r ii^ina i  lav  out 


(hi  Alter  vertical  merge 

figure  4  10  A  vertical  merge  example 


ALGORITHM  4.5  Scanline  Method  for  Layout  Contraction 

Input:  a  list  ol  boxes  B  in  one  mask,  level,  the  shrinkage  distance  J  . 

and  the  feature  size  threshold  h  . 

Output  a  list  of  contracted  boxes  B, 

begin 

1  Sort  B  by  top  edge  of  box  in  descending  order 

2  L  se  scanline  sweeping  in  -v  direction  to 
transform  B  into  B,,  which  represents 

a  set  ol  maximum  horizontal  strips. 

3  Shrink  everv  box  in  B;/  by  distance  d  . 

4  Set  scanline  at  top  position  of  layer  again,  sweep 
layer  in  -y  direction  to  insert  missing  boxes  due  to 
layer  contraction. 

5  Sort  B;/  by  right  edge  of  box  in  descending  order. 

6  Set  scanline  at  right  edge  of  layer,  sweep  in  the  x  direction 
to  vertically  merge  boxes  with  y  direction  feature  size  less 
than  pre-defined  threshold  h  into  contracted  set  B(  . 

7  returnfB,  ). 
end 


It  is  noted  that  shrinking  boxes  is  performed  after  a  set  of  maximum  horizontal  strips  are 
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■  r  ’  .  i  .i  •  .  ;  •  '  .is  ’il  I  lii1  I'x.imple  n  -hi1'*  n  m  I  n’lin-  -4  I  1  m  nu!:  il  i  he  ‘  eat 

"■  ■  -  •'  *.  k.1*  In*  ri-si/irii;  distance  i  lie  jhcvf  algorithm  r  r  diKe  t 

i  In  rder  make  the  algorithm  m«>re  robust  a  nmdilird  algorithm 

.  .irtlire  "nr’ii.  i!  is  ile\  eli ipei!  and  gr.  en  as  lollows 


\I  (iOKI  I  HM  4.6  Modified  Scanline  Method  for  layout  Contraction 

pi.’  ,i  '  s  ees  B  m  one  mask  level,  the  shrinkage  distance  J 

,i  nil  he  t  e.t '  lire  size  i  h  res  ho  Id  h 
h.'pi.;  ,i  i,:  :  ,  ntijiinl  boxes  B, 


i _ _ 


>d>  Incorrect  Contraction 


( b)  Correct  Contraction 


figure  4  1 1  An  example  to  show  incorrect  contraction 
by  scanline  algorithm. 


1  St  rl  B  right  edge  ol  box  m  descending  order 

2  l  se  \,inlini'  ‘-weeping  in  »  direct  inn  u. 

'rjnslorm  B  into  B  i  hii  h  represents 

a  set  <  t  maximum  vertical  strips 
2  Shrink  e>erv  ixsx  in  B  hv  distance  d  in  v 
direc  t  inn 

■4  Sort  B  In  top  edge  of  box  in  descending  oroer 
^  l  se  scanline  sleeping  in  -v  direction  to 
transform  B  into  B  ,  which  represents 
a  --e!  i  t  maximum  hori/ontal  strips 
<■>  Shrink  ever,  txsx  in  B;(  bv  distanced  in  \ 
direc  t  ion 

7  Set  ssanline  at  right  edge  ol  laser,  sweep  in  the  »  direction 
to  verticallv  merge  boxes  'A  ith  y  direction  feature  si/e  less 
than  predehned  threshold  h  into  contracted  set  B, 

■S  ret  unit  B,  ). 


I  tie  tirsi  three  steps  in  the  above  algorithm  are  oriented  at  contracting  the  regions  in  the  v 
d, rectum  while  the  next  three  steps  are  for  contracting  in  the  .v  direction.  It  can  be  easily 
shown  that  at  ter  applying  the  above  algorithm  the  regions  salisl  v  the  definition  of  parallel  con- 
Maction  A  step  bv  step  example  ol  this  algorithm  is  illustrated  in  figure  4.12. 


4.4.4.  Other  applications 


4.4.4. 1.  Pattern  data  preparation  for  E-beam  lithography 


The  writing  speed  is  one  of  the  most  important  requirements  in  the  electron  beam  direct 
writing  used  in  the  VLSI  labrication  process.  The  vector -scan  method  has  high  potential  for  the 
high-speed  pattern  generation  because  it  does  not  have  to  scan  regions  where  no  pattern  exists. 
In  the  vector-scan  system,  however,  complicated  procedures  are  required  to  prepare  data  for 
writing.  This  is  because  the  patterns  which  can  be  written  in  such  systems  must  be  represented 
in  primitive  shapes,  such  as  rectangles.  Therefore,  the  designed  shapes  have  to  be  preprocessed 
in  order  to  produce  suitable  data  for  the  vector-scan  system.  An  important  step  in  preparing 


(d)  Maximum  Horizontal  Strips 


(h)  Maximum  Vertical  Strips 


(e)  \  Direction  Contraction 


(c)  Y  Direction  Contraction 


(f)  Vertical  Merge 


Figure  4.12  Modified  scanline  algorithm  for  contraction. 


pattern  data  is  to  eliminate  overlap  regions,  this  step  can  easily  be  achieved  by  the  algorithms 
described  in  the  previous  section 


4.4.4.2.  Compensation  for  proximity  effect 


The  proximity  effect  in  electron-beam  lithography  is  the  phenomenon  that  the  pattern 
data  receive  a  nonuniform  distribution  of  exposure  due  to  forward-scattered  and  backward- 
scattered  electrons  within  both  resist  and  substrate  regions.  It  will  produce  undesirable  shapes 
after  fabrication,  and  probably  alter  the  circuit  performance.  One  of  the  methods  to  compensate 
for  this  effect  is  to  resize  the  input  patterns  according  to  adjacent  geometries.  For  a  shape  with 
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denser  advent  patierns.  the  edges  ol  this  shape  lacing  these  patterns  should  be  mwardl-  dis¬ 
placed  'I'u  detect  the  adjacent  patierns  lor  a  particular  shape,  the  resizing  algorithm  described 
in  the  previous  section  can  be  applied  with  minor  modifications. 

4.4.4.3.  Compensation  for  pattern  generator  characteristic  error 

The  pattern  generator  used  to  make  layout  masks  has  its  own  characteristic  error,  result¬ 
ing  in  the  feature  size  being  different  from  the  original  design  By  applying  the  resizing  algo¬ 
rithm  to  the  input  data  for  the  pattern  generator,  such  errors  could  be  reduced. 

4.5.  Summary 

At  the  very  beginning  of  this  chapter,  we  describe  the  geometrical  data  structure  used  in 
HPEX.  namely  a  4-d  binary  search  tree.  The  advantage  4-d  binary  trees  have  is  the  efficiency 
of  region  and  point  searchings,  both  having  O ( log\ )  time  complexity.  Since  these  two  opera¬ 
tions  are  frequently  encountered  in  layout  verification  tools  such  HPEX.  employing  a  4-d 
binary  search  tree  in  the  extraction  program  should  reduce  the  overall  runtime  complexity. 
Following  a  description  of  the  4-d  binary  search  tree  and  its  supporting  algorithms,  we  give  a 
detailed  implementation  of  netlist  data  structure  used  in  HPEX.  The  main  feature  of  our  net- 
list  data  structure  is  that  geometric  and  circuit  schematic  information  are  strongly  combined 
together,  thereby  simplifying  extraction  of  circuit  parameters.  In  addition,  since  it  is  easy  to 
access  the  geometric  data  from  the  network  data,  optimizing  the  circuit  performance  by  adjust¬ 
ing  layout  geometries  is  made  feasible. 

Next,  a  new  layout  resizing  algorithm  based  on  scanline  approach  and  a  simple  rectangle 
data  structure  are  presented.  This  resizing  algorithm  has  many  applications,  such  as  compensa¬ 
tion  of  process  bias,  data  preparation  for  E-beam  lithography,  and  reduction  of  proximity 
effect.  Our  resizing  algorithm  treats  layout  expansion  and  contraction  differently  in  order  to 


improve  the  ethc  ieni.  .  o!  the  algorithm  H\  using  this  algorithm  !oi  circuit  extraction  *e  van 
etlivienlK  compensate  process  satiations  h\  c  out  rac  t  me  or  expanding  v  orre'-p-  ndmg  lac  out 
regions  Since  circuit  parameters  stronglx  depend  on  laxoul  :  eat  ure  sizes,  considering  the  actual 
I  eat  ure  sizes  alter  I  abdication  would  substantial!  \  imprm  e  the  ju  urac  i  in  c  ircuit  ext  rac !  i<  n 

I  he  data  structure  and  algorithms  mentioned  in  this  chapter  pros  ide  a  basis  lor  develop¬ 
ing  geometrical  extraction  algorithms  iri  HIM  X,  while  resistor  and  capacitor  models  derived  in 
Chapter  3  provide  a  computational  method  lor  interconnect  A’C  modeling  Beginning  in  Chapter 
5.  we  will  concentrate  on  circuit  extraction  and  parasitic  modeling  algorithms  used  in  1 1  PI- \ 
and  detailed  algorithms  of  hierarchical  and  Hat  circuit  extractions  used  in  HIM  \  will  also  be 
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extraction  time  Ivcaase  each  instance  <•  the  cells  has  to  tie  extracted  In  aooit  on  the  mem  - 
space  required  max  he  verv  large,  since  the  lav  out  is  stored  in  a  rial  representat .on  1  here!  -re 
:  r.  m  the  ex  t  rac  t  ion  time  a  nil -space  point  ot  .  lew  .  it  is  more  efficient  to  build  a  circuit  extract  - 
w  hich  is  able  to  exploit  the  hierarchs  :n  the  circuit  layout 

Although  hierarchical  extraction  can  sav  e  (Tl  time  and  memorv  space  the  actual  sav  mgs 
strongly  depend  on  the  regularity  factor  in  a  layout  design  This  regularity  (actor  is  defined  as 
the  ratio  ot  the  total  number  of  transistors  after  layout  flattening  to  the  number  ot  transistors 
actually  drawn.  If  this  factor  is  large  the  savings  can  tie  pronounced  1  urt  her  more,  the  output 
format  will  be  more  readable  if  the  hierarchv  of  a  circuit  lav  out  is  taken  into  account  <>ur 
objective  here  is  to  develop  an  efficient  hierarchical  parasitic  circuit  extractor 


In  the  literature,  several  hierarchical  circuit  extractors  have  been  reported  [52-54  4.55 
56]  Most  ot  these,  how  ever,  extract  onlv  parasitic  capacitances  [52 -55 1  Magic  s  !a\ out  extras 
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methodology  before  developing  an  extraction  program.  Several  methodologies  emphasizing 
different  aspects  have  been  proposed  [58],  Generally,  design  methodologies  fall  between  the 
following  two  extremes: 

( 1 )  Restricted  layout  hierarchy  : 

In  each  level  of  the  layout  hierarchy  cells  are  not  allowed  to  overlap,  and  geometric  prim¬ 
itives  also  are  not  allowed  to  overlap  any  cell.  Under  such  constraints,  cells  communicate 
with  the  environment  only  through  ports  which  are  defined  on  the  boundary.  Another 
restriction  is  that  no  partial  devices  can  be  formed  on  the  cell  boundary. 

(2)  Nonrestricted  layout  hierarchy  : 

No  constraints  have  been  made  or  assumed  in  this  methodology.  Cells  or  primitives  and 
cells  are  allowed  to  overlap  whenever  necessary.  The  designer  has  total  freedom  in  laying 
out  cells  and  placing  them  in  any  preferred  position.  In  general,  devices  are  allowed  to  be 
placed  right  on  the  cell  boundary. 

In  the  restricted  design  methodology,  the  cell  composition  rules  are  relatively  easy  because 
all  interactions  are  around  the  cell  boundary.  In  addition,  the  layout  hierarchy  is  exactly  the 
same  as  the  circuit  design  hierarchy.  This  makes  the  circuit  extraction  simple  and  straightfor¬ 
ward.  However,  this  style  of  layout  looks  clumsy  and  sometimes  conservative  due  to  overlap¬ 
ping  constraints.  In  the  nonrestricted  design  methodology,  on  the  other  hand,  the  designer  has 
total  control  over  how  to  lay  out  cells  and  use  them.  Nevertheless,  allowing  overlapping 
between  cells  often  makes  the  layout  hierarchy  different  from  its  corresponding  circuit  hierar¬ 
chy.  This  not  only  degrades  the  extractor  performance,  but  also  makes  the  consistency  check 
more  difficult.  In  general,  there  is  a  trade-off  between  the  adopted  design  methodology  and  the 
CAD  performance.  Sometimes  a  little  sacrifice  of  designers'  freedom  will  substantially  upgrade 
the  extractor  performance,  and  in  turn,  shorten  the  design  time  and  increase  productivity. 
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The  layout  methodology  we  choose  is  closer  to  the  restricted  design  hierarchy.  In  order  to 
produce  the  extracted  circuit  which  has  the  same  layout  hierarchy  as  specified  in  the  input 
description,  we  encourage  the  designers  to  design  circuit  layouts  with  no  overlaps  between  cells, 
and  between  cells  and  primitives.  However,  our  extractor  is  still  able  to  handle  overlapping 
cells  if  necessary.  A  simple  method,  described  as  follows,  is  used  to  to  resolve  the  cell  overlap¬ 
ping  problem.  After  the  CIF  input  file  is  read  in  by  HPEX,  a  tree-structure  hierarchy  as  shown 
in  Figure  5.1  is  retained  in  the  internal  data  structure.  The  minimum  bounding  box  (\1BB)  of 
each  cell,  which  is  defined  as  the  cell  boundary,  is  also  calculated  in  the  input  phase.  Suppose 
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that  two  overlapping  cells  were  detected;  we  could  flatten  one  level  of  hierarchy  for  each  cell 
and  continue  this  top-down  process  until  no  overlapped  cell  is  found.  This  method  can  also  be 
applied  to  the  case  when  one  cell  is  overlapped  by  the  geometrical  primitives.  The  worst  case  of 
this  method  is  that  the  whole  design  is  flattened  to  one  level,  and  circuit  extraction  is  per¬ 
formed  in  a  nonhierarchical  fashion.  Thus,  in  order  to  increase  the  performance  of  our  extrac¬ 
tor.  the  designers  are  encouraged  to  use  as  many  nonoverlapping  cells  as  possible  in  the  designs. 

5.3.  Hierarchical  Circuit  Extraction 

5.3.1.  Input  phase 

The  front  end  of  IIPEX  first  reads  in  the  layout  artwork  data  defined  in  the  C1F  descrip¬ 
tion  language,  preserving  the  layout  hierarchy.  Every  cell  in  the  internal  data  structure  con¬ 
tains  several  linked  lists  of  mask  levels  and  pointers  to  other  cells.  Figure  5.2  show's  the  imple¬ 
mentation  of  the  tree  hierarchy  as  shown  in  Figure  5.1. 

The  information  stored  in  the  cell  record  also  includes  the  cell  boundary  and  the  location 
and  mask  level  of  terminals.  Since  the  cell  boundary  is  not  explicitly  specified  in  the  input  file, 
the  cell  boundary  we  use  is  defined  as  the  minimum  bounding  box  (MBB)  which  covers  all 
primitives  and  subcells  of  this  cell.  The  MBB  is  computed  during  the  input  phase  after  all  ele¬ 
ments  are  read  in.  It  must  be  noted  that  some  user-extension  commands  are  defined  in  the  CIF 
file  in  order  to  clearly  indicate  the  cell  terminal  and  net  information.  This  is  due  to  the  fact 
that  in  the  original  CIF  layout  language  only  the  geometric  topology  is  described,  and  no  struc¬ 
tural  circuit  information  is  provided.  For  example,  power  and  ground  nodes  are  not  known  a 
priori,  if  we  do  not  provide  additional  information  in  the  CIF'  input.  This  kind  of  net  or  node 
information  will  be  greatly  useful  and  play  an  important  role  in  the  layout  verification  to  be 


performed  later. 


Figure  5.2  Internal  data  structure  of  HPEX  corresponding 
to  that  shown  in  Figure  5.1. 


5.3.2.  Analysis  of  layout  hierarchy 

In  order  to  adapt  the  input  layout  to  our  design  methodology,  the  layout  hierarchy  should 
be  analyzed  and  reconstructed.  The  main  purpose  in  reconstructing  the  hierarchy  is  to  remove 
all  possible  overlappings  between  instantiated  cells  and  cell  and  geometric  primitives.  The 
hierarchically  recursive  tree  walk  algorithms  employed  in  HPEX  for  restructuring  the  layout 
hierarchy  are  described  as  follows: 


RebuildTree(  A_Cell) 
begin 

1.  List  :=  A_Ceir.subcell: 
while  (List  <>  nil)  do 
begin 

Subcell  :=  ThisCell(List): 
if  (Subcell  is  not  checked  before)  then 
begin 

RebuildTree(Subcell): 

Set  Subcell  to  "checked"  status: 
end: 

List  :=  \extElement(List) 
end: 

end: 

2  RemoveCellOverlap( A_Cell); 
end: 

RemoveCellOverlap(  A_Cell) 
begin 

1  List  :=  A_Cell  . subcell: 
while  (List  <>  nil)  do 
begin 

Subcell  :=  ThisCell(List); 

if  (Subcell  overlaps  other  subcells  or 
geometric  primitives)  then 

begin 

Put  Subcell  in  a  list  ExpandList: 

Delete  Subcell  from  List: 

end; 

List  :=  \exlElement(List) 
end; 

2.  List  :=  ExpandList: 
while  (List  <>  nil)  do 

begin 

Subcell  :=  ThisCell(List); 

Expand  Subcell  in  A_Cell  by  one  level  of  hierarchy: 
List  :=  NexiElement(List): 
end: 

3.  if  (ExpandList  <  >  nil)  then 

RemoveCeJ10ver!ap(  A_Cel! ): 

end: 


All  steps  described  in  the  above  algorithms  are  self-explanatory.  The  procedure  "Rebuild- 
Tree"  hierarchically  removes  all  overlapping  subcells  inside  a  given  cell  by  a  bottom-up 
fashion,  while  the  procedure  "RemoveCellOverlap"  delects  and  expands  overlapping  cells  recur- 
siceK  Once  we  apply  the  procedure  "RebuildTree"  to  the  original  tree  hierarchy,  the  output 


tree  hierarchy  will  then  be  ready  for  the  next  step  processing. 


5.3.3.  Hierarchical  extraction 


The  layout  data  can  be  sent  to  the  hierarchical  extraction  module  after  the  input  layout  is 
read  in  and  preprocessed  by  HPEX.  The  hierarchical  circuit  extraction  algorithm  used  in  HPEX 
can  be  described  in  the  following  PASCAL-like  pseudocode: 

Circu..i  .  act(A_Cell) 
begin 

1  List  :=  A_Ceir.subcell: 
while  (List  <  >  nil)  do 
begin 

Subcell  :=  ThisCell(List); 

if  (Subcell  is  not  extracted  before)  then 
Circuit  Ex  tract(  Subcell): 

Set  Subcell  to  the  "extracted"  status: 

List  :=  NextElement(List) 
end; 

2.  Register  the  location  and  mask  level  of  segments 
for  corresponding  terminals  of  A_Cell; 

3.  List  :=  A_Ceir.subcell: 
while  (List  <  >  nil)  do 
begin 

Subcell  :=  ThisCell(List): 

Instantiate  all  terminal  segments  of  Subcell  in  A_Cell; 

List  :=  NextElement(List) 
end: 

4.  FlatExlract( A_Cell) 
end; 

Since  the  bottom-up  cell  processing  is  employed  in  the  hierarchical  circuit  extraction,  the  cell 
cannot  be  processed  unless  all  its  subcells  are  already  extracted.  Therefore,  for  a  particular  cell 
design  the  first  step  in  the  algorithm  is  to  check  if  all  its  subcells  are  already  extracted.  If  there 
are  some  subcells  which  are  not  extracted  before,  recursive  calls  for  circuit  extraction  will  be 
invoked  for  these  unextracted  subcells.  Otherwise,  this  cell  goes  on  to  the  next  step  for  further 
processing. 

After  step  1  in  the  above  algorithm,  the  cell  under  processing  is  ready  for  circuit  extrac¬ 
tion  However,  some  preprocessing  steps  must  first  be  taken  in  order  to  take  the  boundary 
information  of  subcells  into  consideration.  These  steps  are  primarily  for  the  registration  of 
information  around  the  cell  boundary.  The  cell  boundary  information  is  propagated  from  the 


lower  level  hierarchy  to  its  parent  cell  during  circuit  extraction,  including  the  segments  for  the 
subcell  terminals  instantiated  in  the  parent  cell  and  the  sequence  for  these  segments.  The  rec¬ 
tangle  data  structure  employed  in  our  extractor  easily  represents  any  segment  by  setting  either 
the  length  or  the  width  to  zero.  The  sequence  used  for  the  instantiated  segments  is  important 
and  will  be  used  to  register  the  external  node  numbers  for  the  subcircuit  calls  when  the  circuit 
extraction  is  finished. 

The  registration  of  cell  boundary  information  is  mainly  done  in  steps  2  and  3.  Step  2  is  to 
find  out  and  register  the  terminal  information  of  the  cell  currently  under  extraction  for  later 
references.  As  for  step  3.  all  terminal  information  about  the  subcells  is  instantiated  and 
registered  in  the  parent  cell  in  order  to  correctly  interpret  the  information  contained  in  the 
parent  cell.  In  this  instantiation  the  space  transformation  is  necessary  and  can  be  calculated 
according  to  translation,  rotation  and  mirroring  as  specified  in  the  C1F  input.  Since  we  only  deal 
with  Manhattan  type  layouts,  rotation  of  cells  other  than  multiples  of  90  degree  is  prohibited, 
and  mirroring  of  cells  is  allowed  only  with  respect  to  the  x  —  or  y—  axis.  It  should  be  noted  that 
in  step  3  new  geometries  corresponding  to  all  terminal  segments  of  subcells  are  also  created  and 
added  to  the  cell  under  processing  for  the  terminal  registration.  Once  these  preprocessing  steps 
are  finished,  the  cell  is  then  sent  to  the  flat  extractor  module  for  the  transistor  and  parasitic 
extraction. 

5.4.  Flat  Circuit  Extraction 

f  lat  circuit  extraction  in  HPEX  can  be  illustrated  by  the  flowchart  shown  in  Figure  5.3. 
The  artwork  data  is  first  preprocessed  according  to  process  variations  by  layout  resizing  algo¬ 
rithms  described  in  Chapter  3  .  By  applying  boolean  mask  operations.  MOS  transistors  and 
interconnection  geometries  are  efficiently  and  correctly  identified.  This  step  is  called  geometri¬ 
cal  extraction,  generally  consisting  of  two  major  steps:  (11  finding  active  devices  from  certain 
combinations  of  mask  layers  specified  in  the  technology,  and  (2)  identifying  electrically 
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Figure  5.3  Flowchart  for  HPEX  to  generate  SPICE  file. 


connected  nets.  Following  this,  detailed  interconnect  parasitics  are  extracted  by  feeding 
geometrical  nets  into  the  RC  modeling  module.  Finally,  an  accurate  node  reduction  algorithm 
is  applied  to  reduce  the  number  of  parasitic  elements  In  this  chapter,  we  will  mainly  focus  on 
general  flat  circuit  extraction  procedures  and  the  RC  modeling  The  detailed  node  reduction 
method  will  be  discussed  in  the  next  chapter 
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5.4.1.  Extraction  procedures 


In  HPEX,  flat  circuit  extraction  procedures  can  be  summarized  as  follows: 

Circuit  extraction  steps 

El.  Read  all  geometric  primitives  specified  in  the  cell. 

E2.  Resize  the  input  layout  data  by  user's  specified  process  bias  and  construct  4-d  binary 
search  trees  [45]  for  each  mask  level; 

E3.  Find  all  overlaps  between  diffusion  and  poly  rectangles  to  determine  transistor  channels 
by  excluding  butting  and  buried  contacts,  then  remove  all  channels  from  diffusion  rectan¬ 
gles: 


E4  Use  depth-first  search  to  group  channels  which  belong  to  one  transistor  and  from  transis¬ 
tor  channels  find  rectangles  associated  with  the  drain  or  the  source: 

E5.  Use  information  about  transistor  source,  drain  and  gate  to  find  all  nets  by  depth-first 
search  through  all  electrically  connected  rectangles; 

E6.  Find  all  power  and  ground  nets  from  user-specified  coordinates  in  CIF  input  file: 

F7.  Feed  all  nets  except  power  and  ground  nets  into  interconnect  RC  model  module  to  com¬ 
pute  associated  resistances  and  self-capacitances; 


F&.  Compare  nets  pairwise  to  compute  all  possible  coupling  capacitances: 

F9.  Find  all  i,  o.  power,  or  ground  node  numbers  from  user-specified  coordinates; 


1. 10  Perform  the  node  reduction  on  some  of  the  nets: 

Fll  Report  a  network  composed  of  circuit  elements,  such  as  transistors,  resistors  and  capaci¬ 
tors.  and  ns  i/o,  power  and  ground  node  numbers. 
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In  the  extraction  step  E2.  the  4-d  binary  trees  for  diffusion  and  poly  regions  have  to  be 
updated  because  channel  regions  have  to  be  removed  from  diffusion  and  polv  trees.  In  this  way. 
not  only  transistors  with  simple  rectangle  gates  but  also  those  with  circular  or  serpentine  gates 
can  be  identified.  An  example  illustrating  how  to  update  diffusion  and  poly  trees  is  shown  in 
Figure  5.4  where  new  diffusion  rectangles  C  and  D  and  poly  rectangles  E.  F  and  G  need  to  be 
inserted  in  the  trees  while  two  original  rectangles  A  and  B  are  marked  inactive.  If  there  is  a 
long  poly  rectangle  intersecting  several  diffusion  rectangles,  the  decomposition  of  this  poly  rec¬ 
tangle  into  smaller  rectangles  is  required  in  order  to  separate  channels  from  interconnect  polys. 
In  the  decomposition  algorithm,  the  quick  sort  algorithm  is  first  applied  to  find  an  increasing 
order  of  channel  segments,  and  then  by  this  order  we  can  easily  decompose  the  original  rectan- 


5.4.2.  RC  modeling  module 


In  interconnect  RC  modeling,  the  first  step  is  to  search  for  all  nets  in  which  each  net  con¬ 
tains  many  rectangles.  Then  from  the  net  information  an  RC  network  model  will  be  generated 
in  step  E6.  If  all  resistances  are  neglected  in  one  net.  all  rectangles  in  this  net  should  be  equipo- 
tential.  But  for  detailed  modeling  of  the  electrical  behavior  of  the  interconnects,  resistances 
must  be  included.  The  difficulty  associated  with  the  resistance  extraction  is  that  resistances  are 
strongly  dependent  on  the  current  flow  in  one  net.  However,  the  information  about  current 
flows  is  unknown  before  actually  simulating  the  circuit.  In  order  to  achieve  greater  accuracy 
we  therefore  have  to  preprocess  mask  data  information  in  the  net  since  only  simple  resistance 
formulas  are  used  in  our  extractor.  In  preprocessing  mask  data,  the  rectangles  which  is  electri¬ 
cally  abutted  by  other  rectangles  are  decomposed  into  a  series  of  rectangles.  The  rectangle 
decomposition  algorithm  we  use  is  described  as  follows: 
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Insert  new  rectangles  E.  F  and  G  into  the  poly  tree, 
and  new  rectangles  C  and  D  into  the  diffusion  tree 

Figure  5.4  Update  diffusion  and  poly  trees. 


Rectangle  Decomposition  Algorithm 

Input  :  a  rectangle  A  in  one  net. 

Output  :  decomposition  of  the  rectangle  according  to  its  environment 
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begin  _ 

Rl.  Suppose  that  segment  A^A-,  is  the  wider  side  _  _ 

of  rectangle  .4  Find  all  butting  segments  B  XB2.  B3BA  •  •  and  Bh -\Bn 

according  abutting  rectangles  of  the  rectangle  .4 

R2.  Merge  and  Sort  segments  B  XB 2.  B 2B A . 

and  Bn  _ , Bn  into  an  increasing  order  of  segments 
C,C2.C3C4.  ...  and  CL  _,C;  . 

R3.  From  segments  C  XC2.  C3C4 . 

Ck  _tCk  and  AXA2  decompose  the  rectangle  A  . 

end; 


As  an  example.  Figure  5  5  shows  how  the  rectangle  can  be  decomposed  into  several  rectangles 
bv  the  above  algorithm.  Notice  that  the  quick  sort  algorithm  is  used  in  step  R2. 

Before  turning  to  the  algorithm  for  branch  creation,  we  need  some  definitions.  If  two  rec¬ 
tangles  have  more  than  one  electrically  connected  point,  they  are  defined  to  be  electrically  con¬ 
nected.  If  the  rectangle  electrically  connects  more  than  two  rectangles,  it  is  said  to  be  a  knot. 
Also,  the  rectangles  associated  with  the  drain,  source  and  gate  of  any  transistor  are  defined  as 
ports,  which  need  not  be  decomposed.  For  convenience,  we  define  the  rectangle  which  has  only 
one  electricallv  connected  rectangle  as  a  knot,  since  a  node  number  should  be  created  for  this 
rectangle  in  modeling  interconnect  parasitics.  After  knots  and  ports  are  defined  in  one  net,  they 
are  assigned  different  integer  numbers.  The  branch  creation  algorithm,  described  below,  is  then 
applied  to  find  all  branches  in  the  net. 

Branch  Creation  Algorithm 
Input  :  a  net  ,V 

Output  :  all  branches  in  this  net  A 
begin 

Bl.  Create  an  electrically  connected  graph  based  on  rectangles  in  the  net  A; 
in  this  graph,  every  node  represents  a  rectangle  and  edges 
indicate  interconnect  information  between  rectangles 

B2.  From  any  port  (or  knot)  with  unmarked  edges,  find  every  path  ending 
at  the  other  port  (or  knot):  and  define  this  path  as  a  branch. 

Then  mark  all  traversed  edges. 

B3  Find  another  port  (or  knot)  with  unmarked  edges,  then  go  to  step  B2; 
otherw  ise,  return. 

end; 
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In  Figure  5.6.  we  have  an  example  to  show  how  to  use  the  above  algorithm  to  create  all 
branches  in  a  net.  Every  branch  consists  of  a  list  of  rectangles  in  one  path,  and  this  information 
is  extremely  useful  in  modeling  interconnect  parasitics.  It  should  be  mentioned  here  that  the 
information  about  all  branches  in  one  net  is  stored  in  the  branch  pointer  in  the  net  record. 


2  and  9  knot 

A.  B.  C  .  and  D  :  port  or  knot 


Electrically  Connected  Graph 


Created  Branches 

( 1 )  A  1  2 

(2)  2  3  4  5  D 

(3)  2  6  7  8  9 

(4)  9  10  B 

(5) 9  11  12  13  C 

Figure  5.6  Branch  creation. 


The  next  step  is  to  perform  resistance  and  capacitance  calculations.  However,  corner  rec¬ 
tangles  have  to  be  first  detected  in  order  to  accurately  model  resistance  of  bends.  These  corner 
rectangles  could  be  easily  found  as  long  as  two  adjacent  rectangles  are  known.  In  order  not  to 
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overcount  resistance,  resistances  of  the  rectangles  associated  with  ports  and  knots  are  also  care¬ 
fully  approximated.  For  knots  and  gate  ports  only  half  of  the  resistance  is  taken  into  account  in 
resistance  calculation,  while  for  diffusion  ports  the  resistance  is  not  computed  due  to  the  fact 
that  a  circuit  simulator  such  as  SPICF2  usually  will  generate  this  kind  of  information  inter¬ 
nally.  Once  the  resistances  of  all  rectangles  in  one  branch  are  known,  their  summation  is 
clearly  the  resistance  of  the  branch.  For  self-capacitance  calculation,  empirical  formulas 
described  in  the  preceding  chapter  are  used:  however,  equivalent  dielectric  constants  are  taken 
into  formulas  if  there  are  layers  of  different  dielectrics.  \ote  that  self-capacitance  of  one 
branch  is  calculated  when  computing  the  branch  resistance.  Finally.  7r-lumped  circuit  model  is 
used  to  approximate  distributed  RC  behavior  of  all  branches.  Another  feature  of  this  step  is 
that  threshold  values  Rlh  and  CsW/rA  are  specified  to  filter  out  trivial  resistances  and  capaci¬ 
tances.  Therefore,  all  nodes  with  zero  resistance  in  between  have  to  be  collapsed  into  one  node: 
the  depth-first  search  algorithm  is  used  to  find  these  nodes. 

Before  calculating  the  coupling  capacitance  between  two  nets,  their  minimum  bounding 
boxes  are  compared  to  decide  whether  the  coupling  is  too  small  to  calculate.  Since  branches  are 
basic  elements  in  calculating  the  coupling  capacitances,  their  minimum  bounding  boxes  are  also 
compared  in  order  to  filter  out  small  couplings.  Once  the  coupling  capacitance  between  two 
branches  is  determined,  the  coupling  capacitor  is  inserted  between  them  through  the  T-like 
lumped  circuit  model.  The  reason  to  use  this  model  is  that  it  is  easier  to  insert  the  coupling 
capacitance,  and  at  most  one  more  node  needs  to  be  created  in  each  branch.  Note  that  since  the 
self-capacitance  calculation  is  separated  from  the  coupling  capacitance  calculation  in  our 
approach,  some  errors  will  be  produced  because  both  capacitances  are  closely  related.  A  general 
increase  in  the  coupling  capacitance  decreases  the  self  capacitance.  However,  the  effect  of  this 
trend  is  considered  as  second  order  and  should  not  affect  the  circuit  performance  significantly. 


.1  1.1 


5.5.  Examples  and  Discussion 


All  algorithms  and  techniques  described  in  the  previous  sections  have  been  implemented 
in  HPEA.  The  program  is  coded  in  PASCAL,  which  runs  on  a  SUN  3/75  workstation  using  the 
UNIX  operating  system  and  has  about  11000  lines  of  codes,  including  program  statements  and 
documentation 


5.5.1.  Example  1:  Extraction  and  simulation  of  NJAIOS  one-bit  full  adder 

A  schematic  circuit  diagram  of  a  full  adder  is  shown  in  Figure  5.7.  This  circuit  was  laid 
out  in  3/am  N'MOS  technology.  Before  extraction,  several  process  dependent  parameters  must  be 


CARRY 


Figure  5.7  An  N'MOS  one-bit  full  adder. 


.» 


without  parasitics 
with  only  capacitances 
with  all  parasitics 


Time  (ns) 


Figure  5.8  Simulation  output  waveforms  for  one-bit  full  adder: 
(a)  without  interconnect  parasitics:  (b)  with  all 
interconnect  parasitics;  (c)  with  only  capacitances. 


input  waveforms  A.  B.  and  C.  There  are  three  output  waveforms  for  SI  'M  and  CARRY  The 
solid  lines  represent  the  waveforms  without  interconnect  parasitics,  while  the  dotted  lines  are 
the  simulation  output  of  the  extracted  circuit  including  all  interconnect  parasitics.  In  addition, 
the  circuit  with  only  interconnect  capacitance  is  simulated  in  order  to  estimate  the  effect  of 
interconnect  resistance.  These  output  waveforms  are  almost  identical  to  those  with  all  intercon¬ 
nect  parasitics.  Basically,  the  three  output  waveforms  are  very  similar  in  shape,  except  for  sig¬ 
nal  delays  which  are  different.  The  longest  delay  is  observed  in  the  circuit  with  all  intercon¬ 
nect  parasitics.  If  the  coupling  capacitances  are  large  enough,  sometimes  glitches  or  spikes 
might  appear  in  the  output  waveform.  But  this  is  not  the  case  in  this  example.  Thus  it  can  be 
concluded  that  the  coupling  capacitances  are  too  small  to  seriously  affect  the  performance  of 
this  circuit  Furthermore,  the  interconnect  resistances  also  have  no  major  effect  on  circuit  per¬ 
formance  and  therefore  can  be  neglected  in  this  example 

5-5.2.  Example  2:  CMOS  PLA 

The  PI. A  layout  of  this  example  is  generated  by  PANDA  [37],  realizing  the  following 
I  unction. 

T  =  r  (  Tv  +  xy  )  +  (  xy  +  tv  )5 
c  =  xy  +  v5  +  v5 

where  t  ,v  and  c  are  inputs,  and  s  and  c  are  outputs  Although  PLA  circuits  produced  by 
PANDA  are  implemented  in  the  CMOS  technology,  they  employ  NMOS  circuit  design  concept, 
i.e  most  PMOS  transistors  are  used  as  pull-up  resistors  and  their  gates  are  tied  to  ground.  The 
extracted  circuit  of  this  example  with  all  interconnect  parasitics  consist  of  15  p-lype  transis¬ 
tors.  n-type  transistors.  114  resistors  and  150  capacitors.  For  comparison,  two  other  circuits 
with  different  details  ol  interconnect  parasitic  are  also  extracted  Simulation  results  showed 
that,  in  this  example,  the  effect  of  interconnect  parasitics  is  still  too  small  to  seriously  affect  the 
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5.5 .3.  Example  3:  NMOS  serial  adder 

The  layout  of  this  serial  adder  is  obtained  from  [38].  It  is  realized  in  3^tm  NMOS  technol¬ 
ogy  and  its  logic  diagram  is  shown  in  Figure  5.9.  The  extracted  output  contains  41  transistors. 
90  resistors  and  117  capacitors.  Simulation  results  show  that  interconnect  resistances  are  too 


small  to  degrade  the  timing  performance,  while  interconnect  capacitances  indicate  a  significant 
impact  on  the  signal  delay,  l  or  one  particular  input  vector,  the  critical  path  delay  increases 
from  18.6  ns  to  22.55  ns  by  adding  all  parasitic  capacitances. 

F  rom  the  above  three  examples,  we  conclude  that  the  effect  of  interconnect  parasitics  on 
circuit  performance  is  not  only  layout-dependent,  but  also  technology-dependent.  Signal 
delays  in  circuits  are  usually  affected  by  self-capacitances  and  resistances  However,  coupling 
capacitances  might  produce  glitches  or  spikes  and  their  magnitude  is  heavily  dependent  on  the 
actual  circuit  layout.  In  addition,  the  effect  of  capacitive  coupling  noise  is  not  known  until  the 
extracted  output  is  simulated.  Therefore,  in  circuit  designs  one  must  have  the  ability  to  esti¬ 
mate  the  coupling  capacitances  between  signal  lines,  and  take  their  effect  into  account. 

5.5.4.  Example  4:  Hierarchical  extraction  of  NMOS  half -shift  register 

An  example  circuit  layout  for  a  half-shift  register  and  its  corresponding  extracted  output 
appear  in  Figure  5.10.  Since  the  output  has  the  same  format  as  the  SPICE  input,  we  can  easily 
perform  circuit  simulations  on  this  output.  A  point  to  be  noted  here  is  that  the  process  file 
which  contains  process  information  such  as  oxide  and  conductor  thicknesses  is  separated  from 
the  executable  file.  As  a  result,  different  process  parameters  for  any  given  layout  can  be  easily 
substituted  to  predict  performance  variation  due  to  different  process  lines. 

Runtimes  of  a  set  of  idealized  layouts  which  are  composed  from  the  dynamic  shift  register 
shown  in  Figure  5.10  are  listed  in  Table  5.1.  Each  successive  circuit  layout  consists  of  four 
instances  of  the  previous  layout  with  no  overlapping  cells.  The  runtime  for  extraction  with 
coupling  capacitances  is  proportional  to  the  perimeter  of  the  cells  plus  the  factor  due  to  check 
ing  steps  in  handling  the  layout  hierarchy  and  calculating  coupling  capacitances.  However,  the 
runtime  for  extraction  without  coupling  capacitances  is  proportional  to  the  perimeter  of  the 
cells  and  the  factor  only  due  to  preprocessing  steps  in  handling  the  layout  hierarchy.  It  is 
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Table  5.1  Runtimes  f  or  different  compositions  of 
a  half-shift  register. 


Layout 

Flat 

transistors 

Runtime  (sec) 

( with  coupling 
capactances) 

Runtime  (sec) 
(without  coupling 
capacitances) 

HalfShift 

3 

2.37 

2.12 

2x2 

12 

2.82 

2.43 

4x4 

48 

3  97 

3.38 

8x8 

192 

6.63 

5.33 

16x16 

768 

14.22 

8.65 

32x32 

3072 

37.15 

16.92 

64x64 

12288 

113.18 

35.08 

128x128 

49152 

387.85 

75.15 

256x256 

196608 

1427.28 

172.90 

observed  that  the  runtime  for  extraction  with  coupling  capacitances  becomes  pronounced  com¬ 
pared  to  that  for  extraction  without  coupling  capacitances  when  a  layout  has  a  large  number  of 
transistors  and  a  deep  hierarchy  This  phenomenon  can  be  attributed  to  the  fact  that  time  com¬ 
plexity  O(.V').  where  A'  is  the  number  of  nets,  for  computing  coupling  capacitances  plays  a 
deciding  role  in  the  total  time  complexity  when  the  layout  is  significantly  large 


5.5.5.  Example  5  :  Flat  extraction  of  NMOS  one-bit  full  adder 

lable  5?.  shows  the  runtimes  of  flat  extractions  for  different  numbers  of  one-bit  full 
adder  circuits  as  illustrated  in  Example  1.  Note  that  the  number  of  capacitors  as  shown  in  the 
table  contains  both  seif  capacitances  and  coupling  capacitances.  Experimental  data  indicate  that 
time  complexity  (MA7og(.V  J).  where  .V  is  the  number  of  boxes,  of  4-d  binary  search  trees  [45] 
used  in  geometrical  extraction  will  dominate  extractor  performance  Therefore,  in  order  to 
improve  extractor  performance,  the  scanline  approach  for  geometrical  circuit  extraction  is 
recommended  since  Us  observed  time  complexity,  in  general,  is  (HN  )  where  A'  is  the  number 
of  boxes  in  the  lav  out  Another  observation  is  that  for  moderate  sizes  of  cells  the  runtime  for 
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Table  5.2  Runtimes  for  different  numbers  of  one-bit  full  adder. 


Trans. 

. 

Capac. 

Resis. 

Preproc. 

time  (sec) 

Geom 

extr. 

time  (sec) 

Interconn, 
modeling 
time  (sec) 

Total 

time 

(sec) 

21 

104 

66 

1.92 

3.57 

9.88 

15.37 

42 

212 

134 

2.45 

9.23 

19  57 

30.45 

63 

318 

202 

3.13 

16.82 

28.92 

48.87 

84 

424 

270 

3.87 

26.25 

38.82 

68.93 

105 

530 

338 

4.67 

37.68 

48.40 

90.75 

126 

636 

406 

6.18 

56.78 

64.90 

127.87 
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interconnect  modeling  is  almost  linearly  proportional  to  the  number  of  boxes  specified  in  the 
circuit  layout,  although  some  algorithms  in  modeling  interconnects  require  0(N~)  time  com¬ 
plexity.  where  N  is  the  number  of  nets  in  the  layout.  From  this  observation,  we  conclude  that 
the  ideal  input  for  our  extractor  would  be  a  layout  composed  of  moderate  sizes  of  cells,  each  of 
them  containing  a  few  hundred  tran  .stors. 


5.6.  Comparison  with  Magic’s  Layout  Extractor 

Magic  is  an  interactive  layout  system  developed  by  J.  K.  Ousterhout  et  al.  [59]  for  MOS 
custom  integrated  circuits.  The  system  incorporates  a  set  of  design  and  verification  tools:  a  lay¬ 
out  editor,  a  continuous  design  rule  checker,  a  plowing  tool,  a  router,  and  a  circuit  extractor. 
Since  circuit  extraction  is  our  main  concern,  we  only  discuss  the  comparison  between  HPFX  and 
Magic  s  extractor  in  this  section. 

In  order  to  compare  HPFX  with  Magic's  extractor,  some  circuit  layouts  have  been  edited 
and  extracted  in  the  Magic  layout  system.  These  layouts  are  then  extracted  by  HPFX.  From  our 
experience  with  HPFX  and  Magic's  extractor,  several  differences  between  these  two  extractors 
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are  observ  ed: 
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( 1 )  Computer  Language  *  HPEX  is  written  in  PASCAL,  while  Magic's  extractor  is  coded  in  C. 


(2)  Data  Structure  -  HPEX  uses  a  4-d  binary  search  tree  for  the  internal  data  structure,  while 


Magic's  extractor  is  based  on  a  data  structure  called  corner  stitching  [60],  From  the  CPU 


time  point  of  view,  it  indicates  that  Magic's  extractor  runs  faster  than  HPFX.  This  is 


because  corner  stitching  is  more  efficient  for  geometrical  searching  operations  in  the  con¬ 


nectivity  extraction.  Furthermore,  the  code  in  HPFX  is  not  optimized.  However.  Magic's 


extractor  needs  more  memory  space  due  to  the  nature  of  the  corner  stitching  data  struc¬ 


ture;  the  entire  layout  space  including  both  layer  and  empty  spaces  is  explicitly 


represented. 


13)  Circuit  Model  -  HPFX  describes  the  connections  between  transistors  as  a  detailed  RC  net¬ 


work.  The  extracted  "transistor  netlist  along  with  detailed  RC  networks  are  used  for  tim¬ 


ing  analysis  or  detailed  circuit  simulation  A  simpler  circuit  model  is  adopted  in  Magic's 


extractor  It  models  transistor  interconnections  as  nodes.  A  node  is  like  an  equipotential. 


but  it  includes  a  lumped  parasitic  resistance  and  self  capacitance.  Although  this  node 


model  is  fairly  simple  and  efficient,  it  neglects  the  distributed  behavior  of  interconnection 


lines  by  only  computing  a  lumped  resistance.  In  addition,  a  very  simple  approximation  is 


used  in  Magic's  extractor  to  calculate  lumped  resistances.  This  approximation  can  be 


described  as  follows.  The  total  perimeter  and  total  area  of  each  type  of  layer  comprising  a 


node  are  first  computed.  Then  the  node  is  assumed  to  be  a  simple  rectangular  region.  The 


resistance  is  simply  obtained  by  solving  a  quadratic  equation  from  the  perimeter  and  area. 


Although  this  approximation  is  straightforward  in  computation,  it  performs  poorlv  for  a 


node  with  many  branches,  for  example,  an  interconnect  layout  and  its  corresponding 


extracted  resistance  networks  as  shown  Figure  5.11  It  clearly  indicates  that  the  HPFX 


output  network  is  more  accurate  compared  to  the  resistance  network  generated  b\  Magic 
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overlapping  cells.  All  overlapped  cells  have  to  b<*  flattened  in  this  type  of  extraction. 
Magic's  extractor,  on  the  other  hand,  allows  overlapped  cells  as  long  as  no  transistors  are 


treated  or  destroyed.  The  reason  for  more  flexible  extraction  in  Magic  is  that  the  parasitic 
extraction  of  Magic's  extractor  based  on  a  node  model  is  easier  to  implement  as  compared 
to  that  of  HPEX 


(5)  Output  Format  -  HPFX  produces  the  output  which  is  exactly  SPICE-compatible.  How¬ 
ever.  Magic's  extractor  generates  its  own  output  formal.  If  a  timing  analysis  program  such 
as  CRYSTAL  is  used  to  locate  a  critical  path,  the  Magic  extracted  format  should  be 
translated  into  the  CRYSTAL  input  by  a  program  called  extlsim.  In  this  translation,  all 
extracted  cells  are  expanded  into  a  file  consisting  of  transistors,  resistors  and  capacitors. 
Yo  hierarchical  information  is  retained  in  the  output  by  such  translation,  and  this  is  con¬ 
trary  to  the  HPEX  output.  There  is  also  a  program  called  simlspice  which  transforms 
the  CRYSTAL  input  into  the  SPICE  input.  Unfortunately,  this  program  does  not  recognize 
the  resistance  elements  Therefore,  in  the  SPICE  input  file  all  resistances  •  discarded. 
This  further  degrades  the  accuracy  of  the  interconnect  model. 

In  summary.  HPEX  is  better  in  terms  of  the  parasitic  circuit  model,  but  Magic's  extractor 
performs  well  in  terms  of  speed.  One  suggestion  tor  future  improvement  may  be  the  implemen¬ 
tation  of  HPEX  parasitic  models  in  Magic  or  the  implementation  of  the  earner  stitching  data 


structure  in  HPEX. 
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CHAPTER  6. 


NODE  REDUCTION  TECHNIQUE 


6.1.  Introduction 


Algorithms  to  extract  lumped  RC  circuits  from  the  layout  of  interconnect  regions  have 
been  developed  in  Chapter  5.  The  number  of  extracted  parasitic  elements  strongly  depends  on 
the  actual  physical  layout  and  the  lumped  RC  circuit  models  used  in  circuit  extraction.  How¬ 
ever.  in  general,  a  large  number  of  parasitic  elements  as  compared  to  the  number  of  active  dev¬ 
ices  will  be  extracted  after  circuit  extraction.  In  order  to  predict  the  variation  of  the  timing 
performance  caused  by  introducing  parasitic  elements,  the  transistor  netlist  along  with  the 
parasitic  lumped  RC  networks  have  to  be  simulated  by  circuit  simulators.  If  we  include  all 
these  elements  in  the  circuit  simulation  file,  the  simulation  lime  will  be  increased.  Further¬ 
more.  from  the  verification  standpoint,  it  is  very  difficult  to  pinpoint  and  isolate  the  effect  of 
parasitic  elements  when  we  simulate  the  circuit  file  including  all  interconnect  parasitics.  For 
example,  if  after  circuit  simulation  we  find  that  there  is  a  liming  performance  degradation  due 
to  some  parasitic  elements,  delecting  these  elements  would  be  sometimes  impossible  because  of 
a  large  number  of  parasitic  elements  included  in  the  simulation  file..  Therefore,  by  taking  the 
simulation  time  and  verification  effort  into  account,  in  parasitic  extraction  we  need  a  node- 
reduction  technique  which  is  able  to  accurately  reduce  complicated  parasitic  RC  networks  into 
simple  lumped  circuit  networks. 

A  brute-force  approach  for  the  node  (or  element)  reduction  in  the  extracted  circuit  output 
has  been  proposed  by  Bastian  et  al.  [5].  In  their  approach  only  rule-of-thumb  heuristics  are 
applied  to  reduce  the  number  of  parasitic  elements.  For  instance,  capacitance  and  resistance 
threshold  values  are  set  to  filter  out  some  elements  with  smaller  values.  The  positions  of 


■r.  -\  <e  v.  -■  v . 
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resistors  or  capacitors  are  flipped  in  order  to  combine  two  resistors  or  capacitors.  Although  this 
type  of  approach  is  simple  and  acceptable  m  the  sense  of  locality,  it  often  produces  somewhat 
inaccurate  results  if  we  take  the  global  RC  network  into  account.  One  such  example  is  that  the 
summation  of  a  large  number  of  small  resistances  may  have  a  strong  effect  on  the  signal  delay, 
regardless  of  the  small  values  of  individual  resistances.  Instead  of  using  pure  heuristics  we 
must  consider  the  signal  delay  of  RC  networks  in  developing  an  accurate  node-reduction  tech¬ 
nique.  Signal  delay  in  RC  trees  or  general  RC  networks  has  recently  attracted  much  attention 
and  interest  [61-62]  because  of  its  application  in  timing  analysis  of  digital  MOS  integrated  cir¬ 
cuits.  where  MOS  transistors  are  approximated  by  linear  resistors.  Generally,  the  signal  delay 
through  RC  circuits  cannot  be  calculated  in  closed  form.  Without  using  a  circuit  simulation 
technique,  two  approaches  are  proposed  to  analytically  estimate  the  signal  delay  through  RC 
networks  The  first  one  [6 1 )  finds  closed-form  delay  bounds  at  output  node  in  RC  networks, 
while  the  second  one  [62]  roughly  uses  a  time  constant  to  approximate  the  signal  delay  through 
RC  networks  The  advantage  of  the  first  method  is  that  the  error  bound  can  be  predicted  if 
delay  bounds  are  applied  to  estimate  the  timing  information.  However,  one  time-constant 
approach,  though  hard  to  predict  the  error  bound,  is  simple  in  the  delay  calculation. 

In  this  chapter,  a  new  accurate  node-reduction  technique  based  on  a  single  time-constant 
approach  to  reduce  RC  tree  networks  is  presented.  By  applying  this  technique,  not  only  the 
number  of  parasitic  elements  can  be  reduced,  but  also  the  delay  effect  caused  bv  parasitics  can 
be  accurately  retained  This  chapter  is  organized  as  follows  In  order  to  describe  the  node- 
reduclion  problem  some  definitions  associated  with  a  linear  RC  network  are  first  given  Fol¬ 
lowing  this  the  node  reduction  method  lor  a  liming  estimation  is  described.  Based  on  the  results 
derived  lor  timing  estimation,  we  then  develop  a  heuristic  node-reduction  algorithm  for  the 
purpose  of  circuit  simulation 


6.2.  Definitions 


Definition  6 ./  £C  free 

A  tree  network  of  resistances  on  N+l  nodes,  not  including  the  ground  node,  with  node  0 
designated  as  the  root  and  denoting  the  capacitance  connected  between  node  k  and  ground,  is 
called  an  RC  tree. 

Definition  6.2  Rtj  in  an  RC  tree 

Let  Pt  be  the  path  between  the  root  and  node  i  in  the  RC  tree.  Define  P  to  b e  P,  H  Pj  ■ 
Then  R:J  is  defined  as  the  sum  of  resistances  in  path  P  .  If  there  is  no  intersection  between 
paths  P ,  and  PJ .  then  Rtj  is  zero. 

Definition  6.3  Elmore's  time  constant  [63] 

In  a  linear  RC  network  with  zero  initial  charge.  Elmore's  time  constant  of  node  k  is 
defined  as 

oo 

Tdl  =  fty\(t)dt  .  (6.1) 

o 

where  y\  ( t )  is  the  derivative  of  the  transient  response  yt  (t )  of  node  k  in  the  network.  For  an 
RC  tree.  Elmore's  time  constant  of  node  k  can  be  derived  as 

A  V 

T oi  =£  (6.2) 

i  =  i 

Definition  6.4  P-R  bounds  of  an  RC  tree  [6 1  ] 

Consider  a  unit  step  input  applied  to  the  root  node  in  an  RC  tree  at  time  t=  0.  The  fol¬ 
lowing  lower  and  upper  bounds  va  ( t  )  and  \’ui  ( t  )  of  the  voltage  waveform  ( t )  at  output  node 
k  are  defined  as  P—R  bounds  . 
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w  here  T Dk  is  Elmore  s  lime  constant,  and  Tp  and  Tkk  are  defined  as  follows: 


TP  =1  R„C:  . 


A  V  ^,‘C, 

TRl  =1  - 

.=1 


6.3.  Node  Reduction  for  Critical  Path  Timing  Estimation 


6.3.1.  Problem  formulation 


The  problem  we  consider  here  can  be  described  as  follows.  Let  N  denote  a  driver  plus  an 
RC  tree  network  with  the  driver  input  node  and  a  set  O  of  output  nodes.  For  the  liming  esti¬ 
mation  of  a  given  output  node  k  €  O  we  wish  to  replace  the  above  network  with  a  network  NA 
consisting  of  a  driver  plus  a  simple  RC  network  with  only  one  resistance  R^  u  ,  between  the 
root  node  and  node  k  and  one  capacitance  C^/(k ,  (between  node  k  and  ground.  The  criterion  for 
choosing  Riylk )  and  C1  {i  ,  is  that  the  response  at  node  k  due  to  a  unit  step  input  at  the  input  of 
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driver  in  should  "closely"  match  the  corresponding  response  in  the  original  network  N.  Fig¬ 
ure  6.1  shows  an  original  RC  tree  and  its  reduced  network  used  for  timing  estimation. 
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6.3.2.  Previous  approach 

A  wavetorm-bound  technique  has  been  proposed  to  reduce  an  RC  tree  in  a  timing  estima¬ 
tion  program  called  AL'TODELAY  [64],  The  node-reduction  procedures  used  in  AUTODELAY 
can  be  summarized  as  follows: 


( 1 )  For  a  given  voltage  threshold  Vr .  the  propagation  delay  ,  is  approximated  by  finding 

the  cross  point  between  the  mean  response  of  P-R  bounds  and  this  given  threshold  Vr. 
namely,  solving  the  following  equation  for  time  t : 


( 4  )  Compute  >  and  C^u  ,  by 

<u>  =  •  (6-9) 

T  T‘ 

C,qa)  =  -  (6.10) 

Ru 

It  should  be  noted  that  the  propagation  delay  of  node  k  through  the  RC  tree  can  also  be 
approximated  by 


/ ,  +  t , 

(6  11) 

where  f ,  and  r  .  satisfy  the  equations  V'r  =  v.t  ( r , )  and  V-  =  vnl  (t, ).  Recently,  several  research¬ 
ers  [f>4-67]  have  applied  these  types  of  methods  in  estimating  the  signal  propagation  delay 
through  critical  paths  or  in  performing  the  node  reduction  of  interconnect  RC  trees  in  the  VI  SI 
chip. 


6.3.3.  Our  approach 

The  AL'TODELAY  approach  for  the  node  reduction  of  an  RC  tree  is  to  choose  A’ '  .  =  A  , 


and  C  (i )  to  satisfy 


finiU  )  —  Tl  ■  ((>  \  2) 

where  r,  ,  is  computed  from  the  P-R  bounds  using  Equation  (6.S).  The  motivation  here  is  that 

the  responses  in  the  original  and  reduced  networks  will  have  approximately  the  same  delav 

However,  there  is  still  a  degree  ol  freedom  in  choosing  individual  values  for  A  •,  and  C,y,. 

(only  their  product  is  required  to  be  a  constant).  The  AL'TODELAY  method  sets  R,yU  ,  to  A  , 

first,  then  computes  C‘yU  If  the  root  node  in  either  network  is  driven  by  an  ideal  voltage 

source,  then  this  poses  no  problem.  However,  as  in  the  case  ol  VI  SI  circuits,  if  the  root  node  is 

driven  by  a  nonideal  source  (such  as  an  inverter  element )  then  the  source  resistance  and  the 

capacitance  at  the  root  node  also  affect  the  choice  of  the  individual  values  ol  the  elements  in  the 

reduced  network. 

In  this  section  we  consider  a  new  technique  to  compute  the  values  of  the  individual  ele¬ 
ments  A,.yU  j  and  C^u  ,  in  the  reduced  network.  First  we  compute  the  Elmore's  time  constant 
T:n  between  the  root  and  node  k  in  the  original  network.  It  must  be  noted  that  this  computa¬ 
tion  merely  involves  simple  multiplication  and  addition  and  is  therefore  much  simpler  than 
computing  Tt  by  Equations  (6.7)  and  (6.8).  We  then  require  R‘yti  and  C'  , ,  to  satisfy 

^<-yU  )  =  Tr>L  .  t  6  1  3  ' 

The  motivation  lor  such  requirement  is  based  on  the  following  theorem 

Theorem  7.1 

Consider  an  approximation  of  the  waveform  v,  ( t  )  at  node  k  in  the  RC  tree  by  the  waveform 
A  -f/rm 

(t  )  ^  1  —e  )  for  a  step  input.  It  can  be  shown  that  tins  waveform  \  ,„.(f  ’■  tails 

between  P-R  bounds  [62, 68],  i.e..  vu  ( t  )  ^  vn,ti ,  (r  )  ^  v,(.  (t  ). 

Proof : 

It  is  noted  that  f  or  an  RC  tree.  7^  ^  Tnt  ^  Tf, .  there  are  three  cases  I  or  v:/(i  )— r  _  1 1  I 
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and  there  are  two  cases  for  v(1  (/  )—  v£/n|  i  (/  ) 


( i )  r  ^  rni  -r. 


In  the  next  section  we  will  discuss  the  problem  of  the  nonzero  source  resistance  of  the 


drr.  er  element  in  more  detail  and  derive  a  further  restriction  on  C,!  .  ,  w  hich  together  w  ith 


liquation  ( 6  13)  w  ill  yield  explicit  values  for  R‘,  a  >  and  C‘  u 


6.3.4.  Node  reduction  technique 


Since  Mmores  time  constant  is  a  good  single  time  constant  to  approximate  a  multiple 


time-constant  system,  in  our  approach  we  first  let  R'  tk  ,  and  C‘  4  ,  satisfy  Fquation  (6.13) 


I  he  choice  of  R,.,,  k  ,  and  C  (4  however,  depends  strongly  on  the  loading  effect  of  an  RC  tree 


to  its  driver,  and  is  not  arbitrary  It  /?,'  ,  is  simple  taken  as  Rm1  the  loading  effect  of  the  val¬ 


ued  C  .,4  ,  to  driver  is  somewhat  inaccurate  In  finding  an  appropriate  C"  .  , .  we  shift  the 


root  of  the  RC  tree  to  the  input  node  of  driver  and  include  another  constraint,  namelv . 


Mmores  time  constant  from  inrut  node  of  non-ideal  driver  to  output  node  Slavs  the  same 


before  and  after  the  node  reduction  1  et  R.  and  C  lie  on-resistance  and  output  capaci'ance  of 


t0  Jf 

the  driver,  respectively  If  T]n  and  T,h  are  Elmores  time  constants  computed  from  the  input 
node  of  the  driver  to  output  node  k  in  the  original  and  the  reduced  circuits,  respectively,  this 
constraint  can  be  written  as  follows: 


where 
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and 
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Dl 


=  R,  Ic, 
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(6  16) 


f  rom  the  above  constraint  and  Equation  (6.13).  the  values  of  R^a  ,  and  C^u  )  can  be  given  by 
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C,,u  »  IC,  . 
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(6  17) 


(6.18) 


As  an  example,  the  following  equality  must  hold  for  the  circuit  shown  in  Figure  6  1 


R, 


jCs  +  C,  +  c,  +  c3  +  c4  +  cs  +  cb  +  c7 


+  ^,7)^(7, 


|c,  "f  C„l7,  +  /?,.y  ,  7 ,Crv  (  7 ) 


(6.19) 


v.  here  /?,  and  C\  are  the  on-resistance  and  the  output  capacitance  of  the  driver,  respectively. 
On  simplification,  it  is  clear  that  Cr^l7,  has  to  be  equal  to  the  sum  of  all  capacitances  in  the  RC 
tree  Then  from  liquation  (6  13)  the  value  of  R‘  t 7)  can  be  easily  determined.  In  this  particu¬ 
lar  example.  ,,  =  C,+C2+C  ,+C4+Ci+Cf)+C7  and  Rty  =  7’n7/cfyl  7) .  It  is  noticed  that  in  the 
constraint  formulation,  although  the  on-resistance  and  the  output  capacitance  Cs  enter  the 


expression,  they  are  cancelled  out  in  the  final  formulas.  1'herefore.  this  technique  is  independent 
of  the  on  resistance  and  the  output  capacitance  ol  the  driver 


For  the  purpose  of  comparison,  a  modified  AL'TODELAY  method  which  utilizes  the  fol¬ 
lowing  equations  to  find  Rl  (i  >  and  C,^u  )  is  also  studied. 

<u  ,=Ru  ■  (6-20) 

T 

)  =  (6.21) 

<u  , 

This  method  is  similar  to  the  AL'TODELAY'  method  except  that  the  equivalent  time  constant  is 
Tni  instead  of  t,  calculated  by  Equation  (6.8)  via  P-R  bounds.  The  example  circuit  used  in 
this  study  along  with  the  simulation  output  waveforms  are  depicted  in  Figure  6.2.  The  percen¬ 
tage  error  in  the  propagation  delay  is  -42.3  for  the  modified  AL'TODELAY'  method,  while  the 
percentage  error  is  only  -3.8  by  using  our  method.  These  numbers  indicate  that  our  technique 
is  substantially  better  than  the  above  mentioned  technique  in  terms  of  accuracy.  This  is  partic¬ 
ularly  true  when  the  on-resistance  of  driver  is  very  high.  In  such  case,  the  loading  efTect  of 
capacitance  to  driver  becomes  more  pronounced  and  cannot  be  neglected. 

6.4.  Node  Reduction  for  Circuit  Simulation 

6.4.1.  Problem  formulation 

If  we  want  to  use  the  node-reduction  technique  to  reduce  RC  networks  for  circuit  simula¬ 
tion.  then  the  problem  here  is  different  from  the  problem  previously  described.  The  node- 
reduction  problem  for  circuit  simulation  can  be  described  as  follows,  l  et  N  denote  a  driver 
plus  an  RC  tree  network  with  the  driver  input  node  and  a  set  O  of  output  nodes.  We  wish  to 
replace  the  above  tree  network  with  a  network  consisting  of  the  same  driver  plus  a  simple 
RC  network,  in  which  only  the  root  node  and  the  set  of  nodes  in  O  are  involved,  such  that  for 
each  node  k  €  O  there  is  a  resistance  ^  (1  ,  between  the  root  and  node  k  and  a  capacitance 
C”  _  between  the  node  k  and  the  ground  node.  The  criterion  for  choosing  the  element  values 
in  N .  is  that  the  response  at  node  i  €  O  due  to  a  unit  step  input  at  the  input  of  the  driver  in 
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Figure  6.3  A  reduced  circuit  for  circuit  simulation. 


6.4.2.  Heuristic  algorithm  for  node  reduction  in  extracted  circuit  output 

In  contrast  to  finding  just  one  pair  of  R^u  ,  and  C^u  ,  in  the  node  reduction  for  timing 
estimation,  our  goal  here  is  to  find  every  pair  of  R  H  ,  and  C,vU ,  for  each  k  €  O  which  accu¬ 
rately  approximates  the  delays  from  the  input  node  to  output  nodes,  regardless  of  electrical 
parameters  associated  with  the  driver.  However,  the  method  we  develop  is  stiil  based  on  the 
concept  of  node  reduction  for  timing  estimation.  Two  constraints  on  the  values  of  R,qKL)  and 
C  .  after  node  reduction  are  enforced: 

•  T*  <  A  ' 

( 1  )  Flmore's  time  constants  computed  from  the  root  are  equal  in  the  original  and  reduced  net¬ 
works.  namely. 

rDi  =  rf>t  for k  eo  (6.22) 

It  should  be  noted  that  Flmore's  time  constant  after  node  reduction  at  node  k  is  equal  to 

RrjU  >■ 

(2)  Flmore’s  time  constants  computed  from  the  input  of  the  driver  are  equal  in  the  original 
and  reduced  networks,  namely. 


a»t;i 


where 


i  hL  =  T:h  for  a:  €  O 


Tni  =  Q  c,  +  ic,  +rz 


(6  23) 


(6  24) 


Tm  =  *,  CS+£C  U1  +7*. 


(6.25) 


Consider  an  AC  tree  with  m  output  nodes.  From  the  first  constraint,  m  constraint  equations 
can  be  derived.  Hul  only  one  constraint  equation  will  be  obtained  from  the  second  constraint 
alter  cancellation,  namelv 


IQ,u>  =  IQ 


(6.26) 


Therefore,  in  this  case  we  have  to  solve  2m  variables  from  m  +  1  constraint  equations.  Except 
for  m  =1,  the  number  of  variables  is  less  than  the  number  of  equations.  In  order  to  compute  the 
exact  values  for  AQ,.  )  and  C  u  a  heuristic  method  based  on  Equation  (6.26)  is  employed  to 
calculate  m  capacitances.  Equation  (6.22)  is  then  used  to  calculate  the  rest  of  m  resistances. 


The  heuristic  node-reduction  algorithm  can  be  described  as  follows: 


Node  Reduction  Algorithm 


1  For  each  k  €  O  determine  Elmore's  time  constant  Tni  between  root  node  0  and  node  k 

2.  For  each  output  node  k  €  O.  calculate  the  equivalent  A.  ,  .  and  C.  ,,  .  as  follows: 


where  CT  =  I  ct  is  the  total  capacitance  in  the  RC  tree. 

1=1 

3.  For  each  k  €  O.  Tm  can  be  used  to  estimate  the  propagation  delay.  Therefore,  given  a 
delay  threshold,  we  can  determine  whether  lumping  the  equivalent  resistance  £  u  .  is 
really  necessary.  We  report  (1 ,  only  when  the  value  of  TDl  is  greater  than  a  certain 
delay  threshold.  Otherwise,  we  discard  the  element  RlyU  )  and  just  merge  the  two  nodes. 

It  must  be  noted  that  fan-out  gate  capacitances  need  to  be  added  in  the  interconnect  RC 
trees  when  applying  this  algorithm  and  should  be  subtracted  from  the  output  node  capaci¬ 
tances.  Therefore,  the  main  consideration  taken  in  step  2  is  to  make  sure  that  the  value  of  the 
output  capacitance  is  greater  than  zero  after  substracting  fan-out  gate  capacitances.  The  sim¬ 
plest  distribution  of  C;  which  meets  the  above  requirement  is  shown  in  step  2.  namely,  using 
the  output  node  capacitance  as  a  weight  to  distribute  Cr. 

As  an  example.  Figure  6.4  shows  a  circuit  where  a  driver  has  an  RC  tree  as  a  load.  After 
applying  the  node-reduction  algorithm,  the  reduced  circuit  is  shown  in  Figure  6.5.  Assume  that 
the  on-resistance  and  output  capacitance  of  the  driver  are  500  Q  and  0.2  pf.  respectively.  The 
output  waveforms  of  node  21  before  and  after  using  the  node-reduction  technique  for  a  1.66 
V /ns  ramp  input  are  shown  in  Figure  6.6.  The  propagation  delays  and  rise  times  for  all  output 
nodes  are  also  listed  in  Table  6.1.  Comparisons  of  output  responses  show’  reasonable  agreement 
between  the  two  outputs.  This  is  because  in  the  node-reduction  process  we  have  already  con¬ 
sidered  the  loading  effect  of  each  output  node.  Some  errors  produced  are  due  to  the  use  of  a 
smgle-time-constant  system  approximating  a  mulliple-lime-constant  system.  Also  the  increase 
of  R (  and  C,  decreases  the  errors  observed  in  the  output  waveforms.  The  reason  for  this  is  that 
A’,  and  C,  play  dominant  roles  in  determining  the  output  waveforms  when  the  values  of  Rs 


and  C.  become  larger. 


figure  6.5  A  reduced  circuit  corresponding  to  the  circuit 
shown  in  Figure  6.4. 


Before  node  reduction 
After  node  reduction 
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re  6.6  Output  waveforms  at  node  21  before 
and  after  node  reduction. 


Table  6.1  Propagation  and  rise-time  delays 
of  the  circuit  shown  in  Fig.  6.4. 


Node  Number 


Before  node  reduction 


T,,.-  (ns) 

Tpn  (ns) 

Frror  (9c) 

10.09 

2.88 

2.0 

10.08 

2.90 

1.4 

After  node  reduction 


T...,  (ns) 


10.04 


10.04 


!  10.04 


10.04 


3.50 

10.21 

3.45 

3.62 

10.22 

3.57 

Frror  (9< ) 


0.5 


0.4 


0.4 


0.4 


3.3 


6.4.3.  Implementation 

In  applying  the  above  algorithm  to  circuit  extraction,  first  of  all.  the  circuit  extractor 
should  be  able  to  model  interconnects  as  RC  networks.  In  our  circuit  extractor  we  have 
employed  a  l wo-segment.  7r-lumped  RC  circuit  model  to  approximate  the  distributed  behavior 
of  all  branches  in  the  interconnection  region.  Fverv  net  is  then  tested  to  see  whether  further 
node  reduction  is  possible,  depending  on  the  net  type  and  connection  information.  Generally,  if 
the  net  is  connected  to  the  pull-up  node  and  no  pass  transistors  are  connected  to  it.  this  net  can 
be  passed  to  the  node  reduction  module  for  further  processing.  In  the  node-reduction  module, 
the  driver  node  associated  with  each  net  is  first  identified  as  a  root.  As  for  the  RC  tree  synthesis 
algorithm,  the  dynamic  tree  node  allocation  technique  is  utilized  in  order  to  reduce  memory 
overhead  and  make  the  RC  tree  construction  more  flexible.  Also  note  that  during  the  tree  con¬ 
struction  step  fan-out  gale  capacitances  have  to  be  estimated  and  added  to  the  associated  nodes 
of  the  RC  tree  in  order  to  correctly  take  output  capacitive  loadings  into  account.  After  the  RC 
tree  corresponding  to  this  root  has  been  constructed,  the  dept  h-first  search  algorithm  is  used  lo 
find  Flmore  s  lime  constant  of  everv  output  node.  The  algorithm  we  use  is  .-similar  to  the 
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FREE  algorithm  presented  in  [62]  Finally,  this  information  is  used  to  produce  simple  lumped 
A’C  networks  b\  the  heuristic  algorithm  given  m  the  previous  section  Since  the  heuristic 
includes  the  filtering  process,  negligible  resistances  are  always  discarded  and  will  not  be 
reported  in  the  extraction  output. 

The  above  algorithm  is  implemented  in  our  extractor  as  follows: 

Input  transistor  netlist  M0  and  nets  N(). 

Output  transistor  netlist  Mf.  and  nets  N#  w'ith 
reduced  RC  networks. 

PROCEDURE  RFDl  CETRFEX  A/,,  .A,, ): 

begin 

1  From  A/()  find  a  set  of  pull-up  nodes  F . 

2  for  (each  node  i  €  P  )  do 
begin 

3  Fetch  the  associated  net  \\  €  N(,  of  nodei. 

4  Construct  the  RC  tree  T  from  net  ,V(  and 

set  the  root  of  tree  to  i: 

5  if  (no  pass  transistors  connected  to  F  )  then 
begin 

6  Add  fan-out  node  capacitances  to  the 

corresponding  output  nodes  in  Tl : 

7  Perform  node  reduction  and  filtering  process: 

(Node  reduction  algorithm) 

S  Update  node  number  for  transistors: 

9  Subtract  fan-out  capacitances  from  the 
corresponding  output  nodes; 

end; 

end: 

10.  return!  A/a  and  \f. ); 
end: 


6.5.  Examples 

I  fie  first  example  \ce  will  illustrate  is  an  WIOS  one-bit  full  adder.  Circuit  extractions 
vere  pert  rmed  with  and  without  applying  the  node  reduction,  and  followed  bv  SPICE?  1  3] 
^  r.uit  simulations,  f  igure  6  7  shows  the  output  waveforms  of  the  sum  bit  before  a"  ter 
r-  rt  rmmg  ihe  node-reduction  technique.  This  simulation  indicates  that  both  waveforms  agree 
■  er .  xeli  lloccecer  simulation  time  lor  i  he  circuit  with  node  reduction  is  23t7r  less  than  that 


Before  node  reduction 
After  node  reduction 


As  another  example,  the  layout  of  a  CMOS  PLA  consisting  of  44  transistors  was  extracted 
with  and  w  ithout  the  node  reduction  The  circuit  output  consisted  of  SI  resistors  and  99  capa¬ 
citors  without  any  node  reduction,  while  it  consists  of  56  resistors  and  74  capacitors  after  node 
reduction.  Simulation  results  also  show  good  agreement  between  the  two  waveforms  In  this 
particular  example,  the  simulation  time  is  reduced  by  107c  with  the  node  reduction.  Finally,  a 
4-bit  !\M()S  full  adder  consisting  of  82  transistors  was  composed  from  the  1  -bit  full  adder  pre¬ 
viously  considered,  and  was  extracted.  The  extraction  output  contains  154  resistors  and  190 
capacitors  before  the  node  reduction  is  performed.  However,  after  the  node  reduction  only  70 
resistors  and  106  capacitors  are  left  in  the  output.  A  substantial  number  of  resistors  and  capa¬ 
citors  is  reduced  or  filtered  out  in  this  case.  The  results  from  circuit  simulations  still  confirm 
the  accuracy  ol  our  proposed  node-reduction  technique. 

6.6.  Conclusions 

A  simple  node-reduction  technique  for  RC  tree  networks  is  presented  in  this  chapter  This 
method  computes  the  values  of  the  resistances  and  capacitances  in  the  reduced  network  using 
the  concept  of  Flmore  s  time  constant.  Circuit  simulations  show  that  application  of  this  tech¬ 
nique  reduces  complicated  RC  trees  into  simple  lumped  RC  networks  without  significant  loss 
in  accuracy  Therefore,  this  technique  is  useful  in  circuit  extraction.  It  can  be  used  to  decrease 
the  number  of  nodes  in  RC  tree  networks  for  modeling  the  interconnect  regions  in  circuit  lay¬ 
outs.  thereby  reducing  the  circuit  simulation  time  in  the  next  step  of  layout  verification.  A 
heuristic  algorithm  based  on  this  technique  has  been  implemented  in  our  experimental  circuit 
extractor  Simulation  results  show  a  good  agreement  in  timing  for  RC  networks  obtained 
betore  and  after  applying  this  technique.  Another  potential  application  of  this  node-reduction 
technique  is  mentioned  in  [64-65],  wherein  every  RC  tree  is  simplified  to  a  single  RC  circuit, 
.ind  'hen  delav  equations  are  used  to  estimate  timing  through  a  user-specified  critical  path. 


6.7.  Remarks 


The  node-reduction  technique  can  easily  be  adapted  to  RC  mesh  networks  in  which  some 
resistance  loops  exist.  In  this  more  general  case,  the  definition  of  i.lmore  s  time  constant  men¬ 
tioned  in  the  first  section  is  still  valid.  However,  the  value  of  R  can  no  longer  be  determined 
by  inspection  of  the  topology  of  mesh  networks.  Two  methods  can  be  used  to  calculate 
Hlmores  time  constant  in  a  general  RC  circuit  The  first  method  follows  the  definition  of 
Flmore's  time  constant  by  finding  the  conductance  matrix  first  The  conductance  matrix  for  the 
RC  mesh  networks  can  be  easily  obtained  by  the  stamp  method  proposed  in  the  modified  nodal 
analysis  [69],  which  is  widely  used  in  circuit  analysis  programs.  Then  the  resistance  matrix  is 
found  by  inversion  of  the  conductance  matrix,  and  R,  t  is  simply  the  ki'h  entry  of  this  resis¬ 
tance  matrix  [70].  Since  this  method  involves  matrix  inversion,  it  may  be  very  time- 
consuming.  The  second  method,  instead  of  finding  Rtl,  is  to  compute  Hlmore’s  time  constants 
directly  by  the  iterative  scheme  as  mentioned  in  [62].  This  method  employs  the  concept  of  tree 
decomposition  and  load  redistribution  and  solves  a  system  of  linear  equations.  However,  no 
matter  which  method  is  chosen,  once  the  value  of  Flmore's  time  constant  is  known,  the  applica¬ 
tion  of  our  technique  is  fairly  straightforward 


CHAPTER  7. 


CONCLUSIONS 


Layout  verification,  which  checks  if  the  design  works  as  originally  intended,  is  a  crucial 
step  in  the  final  stage  of  a  VLSI  chip  design.  Layout  design  verification  typically  consists  of 
three  functions:  (1)  design-rule  check  (DRC).  (2)  functional  verification,  and  (3)  performance 
verification.  These  functions  are  explained  below. 

Design  rules  are  specified  in  terms  of  a  set  of  geometric  constraints,  which  are  the  physical 
limitations  imposed  by  the  fabrication  process.  For  example,  a  metal  line  cannot  be  narrower 
than  a  certain  width;  otherwise,  it  will  be  subjected  to  an  eleclromigration  reliability  problem 
or  possibly  a  break-up  due  to  misalignment.  Design  rule  checks  involve  intensive  geometric 
manipulations  V  iolations  in  design  rules  will  decrease  the  chip  yield.  Therefore,  all  critical 
design  rules  have  to  be  checked  before  the  design  is  sent  to  be  fabricated. 

Functional  verification  checks  the  extracted  function  of  a  layout  with  the  intended  func¬ 
tion  Circuit  extraction  is  first  applied  to  obtain  the  transistor  nellist  from  the  layout.  This 
extracted  nellist  is  then  checked  for  connectivity  against  the  original  schematic  by  a  schematic 
comparison  program,  or  an  electrical  rule  checker  can  be  used  to  detect  electrical  rule  violations 
such  as  power-to-ground  shorts.  Logic  or  circuit  simulation  can  also  be  employed  to  check  the 
function  of  the  extracted  circuit  However,  it  is  input-dependent  and  cannot  pinpoint  error 
locations  correctly. 

Performance  verification  checks  the  timing  characteristics  against  the  design  specification 
Circuit  parameters  related  to  propagation  delays  must  be  carefully  modeled  and  extracted.  Lor 
example,  the  extractor  has  to  prov  ide  a  list  of  transistors  and  interconnect  parasilics.  and  their 
^  nnectiv  ities  ana  associated  parameters,  such  as  channel  wodths  and  lengths,  capacitance  and 


resistance  values.  This  extracted  information  is  then  fed  into  a  circuit  simulator  to  check  if  the 
design  satisfies  the  timing  specifications. 

Handcrafted  and  semiautomated  design  techniques  result  in  error-prone  layouts,  which 
require  all  three  types  of  layout  verification.  Recently,  there  is  a  trend  toward  the  use  of 
fully-automated  design  techniques  such  as  silicon  compilers  and  macrocell  assemblers.  These 
"correct-bv-conslruction"  tools  are  able  to  implement  circuit  layouts  from  a  high-level  descrip¬ 
tion  without  human  intervention.  Although  design  rules  and  the  layout  function  may  not  be 
checked,  performance  verification  is  still  a  necessity  to  guarantee  a  successful  design.  Thus  is 
especially  true  in  today's  VLSI  circuits  with  higher  complexity  and  even  smaller  feature  sizes. 
Propagation  delays  due  to  interconnects  in  a  VLSI  chip  become  almost  comparable  to  gate 
delays.  Hence,  in  order  to  predict  the  timing  effect  caused  by  interconnect  parasitics.  these 
parasitics  must  be  directly  modeled  from  actual  physical  layouts. 

In  this  thesis,  performance  verification  of  VLSI  circuits  related  to  critical  interconnect 
parasitics  is  tackled  from  several  respects.  First,  two-dimensional  numerical  models  for  inter¬ 
connects  are  studied,  and  are  used  to  produce  the  reference  values  for  deriving  closed-form  for¬ 
mulas.  Second,  in  order  to  automate  the  parasitic  identification  process,  a  circuit  extractor  that 
takes  a  circuit  layout  as  input  and  computes  detailed  interconnect  parasitics  by  using  the  fitted 
formulas  has  been  built.  This  step  involves  some  critical  geometric  manipulations.  Finally,  in 
order  to  predict  the  parasitic  effect  on  the  timing  performance  of  a  VLSI  system,  a  node¬ 
collapsing  technique  which  retains  the  delay  effect  of  interconnects  has  been  studied. 

The  first  part  of  this  thesis  deals  with  two-dimensional  numerical  models  for  intercon¬ 
nects.  Based  on  these  models,  a  finite-element  program  FFMRC  has  been  developed  and  imple¬ 
mented  on  a  SL  \  workstation.  This  program  requires  no  geometric  limitations  in  calculating 
capacitances  and  resistances  due  to  the  nature  of  the  finite-element  method.  In  resistance  calcu¬ 
lation.  contact  resistance  is  caref  ully  modeled  bv  taking  a  quasi-three-dimensional  current  flow 


into  account  Furthermore,  only  a  small  amount  of  input  data  has  to  be  provided  by  the  users 
because  of  the  use  of  an  interactive  grid  generator  KJ(iI2  This  feature  substantially  increases 
the  usefulness  of  FEMRC. 

In  the  second  part  of  this  thesis,  we  concentrate  on  developing  a  hierarchical  parasitic  and 
circuit  extractor  HPEX  to  deal  with  the  increased  complexity  and  small  feature  sizes  of  VLSI 
circuits  Storing  layout  data  hierarchically.  HPEX  is  able  to  handle  a  large  volume  of  data  as 
compared  to  flat  extraction  if  the  regularity  factor  is  very  high.  HPEX  also  performs  extraction 
hierarchically,  thereby  saving  CPU  time  significantly  by  not  extracting  repeated  cells.  Main 
features  of  this  program  are  summarized  as  follows: 

<  1  )  It  is  able  to  model  interconnection  lines  as  distributed  lumped  circuits  directly  from  a  cir¬ 
cuit  layout  and  automatically  renumbers  the  generated  circuit 

(2)  In  order  to  reduce  computation  time,  analytical  formulas  instead  of  numerical  methods 
are  employed  to  compute  interconnect  resistances  and  capacitances  which  include  coupling 
capacitances  between  interconnection  lines 

(3)  The  difference  in  feature  sizes  between  mask  layouts  and  actual  fabricated  conductors  is 
taken  into  account  by  a  geometry  preprocessing  step  in  which  a  novel  scanline  method  is 
used.  The  accuracy  of  the  interconnect  model  increases  if  changes  in  feature  sizes  after 
lubrication  are  taken  into  consideration. 

(4)  In  cell  extraction,  a  new  accurate  node  reduction  technique  is  employed  to  facilitate  design 
\  erificalion 

( ,s  1  The  output  formal  is  exactly  the  SPICE  input 

These  features  demonstrate  that  the  use  of  HPEX  along  with  other  simulation  tools  in  layout 
verification  are  able  to  guarantee  the  successf  ulness  of  the  first  turnaround  in  the  design  cycle 
Since  the  fabrication  process,  which  translates  a  layout  design  into  a  piece  of  silicon,  is  very 


costly  and  time-consuming,  the  production  cost  can  be  significantly  reduced  it  we  are  able  to 
guarantee  the  correctness  of  a  VLSI  design  at  the  very  beginning. 

In  this  thesis,  the  signal  delay  affected  by  interconnects  for  layout  verification  is  primarily 
emphasized.  Hcwever.  there  are  still  other  factors  which  affect  the  reliability  or  the  func¬ 
tionality  of  a  circuit  operation.  The  issues  of  eleclromigration  and  power  or  ground  voltage 
drop  have  recently  attracted  several  researchers  attention  [71-73]  due  to  their  important  roles 
in  the  chip  operation  for  small  feature  sizes.  In  VLSI  chips  power  is  distributed  through  metal 
lines.  Current  flow  leads  to  the  problems  of  metal  migration  and  excessive  voltage  drop.  Metal 
migration  may  produce  a  break  in  a  metal  line,  causing  a  reliability  problem.  Excessive  voltage 
drops  may  result  in  incorrect  logic  operation  and  degradation  in  the  chip  speed  Metal  migration 
is  a  wear-out  phenomenon  in  which  the  median  time  to  failure  (MTTP)  of  a  rpetal  line  is 
inversely  proportional  to  some  power  of  current  density.  In  a  typical  design,  MTTF  should  pass 
the  expected  life  of  the  system.  Since  the  current  density  estimation  in  a  metal  line  strongly 
depends  on  the  physical  layout  and  the  circuit  operation,  estimations  given  bv  designers  are 
tedious  and  difficult.  Therefore,  the  need  for  CAD  tools  that  are  able  to  automate  the  estimation 
of  current  density  in  each  branch  of  the  power  network  is  becoming  increasingly  apparent. 

Some  possible  extensions  of  FEMRC  and  HPEX  may  cope  with  this  problem.  First.  HPEX 
is  enhanced  by  incorporating  FEMRC  to  calculate  resistances  of  irregularly  shaped  conductor 
regions.  Then  an  RC  network  is  extracted  from  the  layout  of  pow'er  distribution.  HPEX  is  also 
used  to  extract  each  individual  cell.  SPICE  simulation  of  these  cells  can  then  be  used  to  estimate 
the  current  loading  to  the  extracted  power  RC  network.  Although  the  current  drawn  by  cells  is 
transient  in  time  and  input-dependent,  it  can  be  transformed  statistically  into  continuous  dc 
electromigration  equivalent  values."  After  the  current  loading  set  is  known,  by  SPICE  simula¬ 
tion  the  current  in  each  metal  branch  and  the  node  voltages  are  easily  computed.  If  the  metal 
branch  is  rectangular,  the  estimation  of  current  densitv  is  obvious.  11  the  metal  branch  is  irreg¬ 
ular.  some  heuristics  or  FEMRC  using  nodal  voltages  as  he  boundary  condition  can  lx*  applied 
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to  estimate  the  maximum  current  density  inside  this  region.  The  current  density  is  then 
checked  against  the  specified  value  to  see  if  the  electromigration  rule  is  violated.  Voltage  drops 
in  power  lines  are  also  estimated  at  this  stage.  If  there  are  some  violations,  a  technique  similar 
to  SPIDER  [72]  is  recommended  to  adjust  the  width  of  conductors. 


Another  possible  extension  of  HPE\  is  to  extract  the  symbolic  layout  from  the  physical 
layout.  This  symbolic  layout  extraction  is  also  known  as  a  circuit  disassembler  [74],  which  not 
only  extracts  devices  and  nets,  but  also  determines  the  relative  positions  of  symbolic  circuit 
elements.  This  type  of  extraction  facilitates  physical  layout  conversion  when  design  rules  are 
changed  by  only  compacting  the  symbolic  layout.  Therefore,  most  of  the  previously  designed 
layouts  can  be  reused  even  when  we  have  different  design  rules. 
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APPENDIX  A. 

HPEX  USER’S  MANUAL 

This  user's  manual  describes  how  to  use  HPEX,  a  semi-interactive  hierarchical  circuit 
extractor.  HPEX  reads  an  intergrated  circuit  layout  description  in  Caltech  Intermediate  Form 
(CIF  2.0)  and  creates  five  different  circuit  descriptions  as  described  in  Chapter  1.  Comparison 
or  simulation  can  then  be  driven  from  each  different  extracted  circuit  to  insure  the  correctness 
of  the  design. 

If  schematics  are  used  to  produce  a  layout,  we  have  to  certify  that  the  actual  layout  is 
what  it  is  intended  for.  One  possible  technique  is  to  use  simulation  as  a  tool  for  verification. 
However,  this  becomes  prohibitive  as  the  size  and  complexity  of  the  circuits  grow.  A  nellist 
comparison  program,  such  as  GEMINI,  is  therefore  adopted  to  verify  the  connectivity  of  the 
extracted  circuit.  Two  netlists  are  requi-ed  as  input  to  GEMINI.  The  first  one  is  a  transistor- 
level  netlist  from  the  schematics.  The  second  one  is  a  netlist  extracted  from  the  layout  by 
HPEX  Netlist  comparison  is  extremely  useful  in  locating  bugs  in  terms  of  connectivity. 

If  timing  information  is  needed  in  verifying  the  timing  characteristics  of  the  circuit,  the 
extracted  circuit  output  has  to  be  simulated  by  a  timing  simulator  or  a  circuit  simulator.  Tim¬ 
ing  simulators  such  as  MOSTIM  can  handle  a  large  circuit  by  simplifying  the  timing  model. 
WHle  circuit  simulators  such  as  SPICE  can  only  simulate  a  few  thousand  devices  due  to  com¬ 
plicated  device  models.  Typically,  only  the  critical  path  of  the  entire  chip  is  simulated  by  a  cir¬ 
cuit  simulator. 

In  this  user's  manual,  the  input  format  for  HPEX  including  some  user  extensions  of  CIE  is 
first  described.  Then  parameters  in  the  process  file  associated  with  HPEX  are  listed  and 
explained.  Finally,  bv  a  layout  example,  the  execution  of  HPEX  is  illustrated. 
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A.l.  Input  Format 

The  input  to  HPHX  is  described  in  a  subset  of  CTI F .  namely.  Manhattan  geometries  Details 
of  CIF  syntax  and  semantics  can  be  found  in  [22]  For  the  purpose  of  labeling  the  signal  names 
of  some  particular  nodes  and  the  terminal  names  of  a  cell  description,  several  user-extension 
commands  on  CIF  have  been  developed.  The  CIF  extension  commands  95.  96.  90.  91.  and  99 
are  used  to  name  nodes  The  forms  of  these  CIF  commands  are  as  follows: 

95  name  x  v  layer: 

96  name  x  v  layer: 

90  name  x  v  layer. 

91  name  x  v  layer: 

99  name  x  y  layer, 
where  95  input  node. 

96  .  output  node. 

90  :  ground  node. 

91  :  power  node. 

99  :  terminal  node. 

and  name  :  character  name  for  signal. 

These  commands  attach  the  name  to  the  mask  geometry  on  the  specified  layer  crossing  the  point 
(x.  y ).  These  names  may  contain  any'  ASCII  character  except  space,  tab.  newline,  double 
quotes,  parenthesis,  and  semicolon.  It  should  be  noted  that  the  point  of  a  signal  name  must  be 
attached  to  a  corresponding  layer  described  in  terms  of  the  designated  integer  numbers.  The 
mapping  between  layers  and  internal  integer  numbers  has  been  described  in  Chapter  4  In  the 
terminal  labeling,  the  attached  point  must  be  on  the  boundary  of  the  corresponding  cell 
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A. 2.  Process  File 

In  order  to  run  IIPEX.  a  process  file  "cmosprocess.h"  has  to  be  included  in  the  current 
directory.  All  the  necessary  process  parameters,  which  determine  the  values  of  interconnect 
parasitics,  are  contained  in  this  file.  By  changing  the  parameters,  users  can  easily  experiment 
with  different  processes.  The  parameters  listed  in  the  file  "cmosprocess.h"  are  summarized  as 

follows: 

( ’  Process  Parameters  *) 

RFSdiff:  diffusion  sheet  resistance 
RFSpoly:  polysilicon  sheet  resistance 
RPSmetall:  metal  1  sheet  resistance 
RFSmetal2:  metal  2  sheet  resistance 
SCRFSditl:  diffusion  specific  contact  resistivity 
SCRFSpolv:  polysilicon  specific  contact  resistivity 
SCRFSmetal:  metal  specific  contact  resistivity 
THICkpoly:  thickness  of  polvsilicon 
THICkmetall:  thickness  of  metal  1 
THICkmeial2:  thickness  of  metal  2 
THlCkfield:  thickness  of  the  field  oxide 

TMICkoxl:  thickness  of  the  dielectric  between  metal  1  and  diff  or  poly 

THICkox2:  thickness  of  the  dielectric  between  metal  1  and  metal  2 

THICkpass:  thickness  of  passivation  layer 

FPSoxl:  permittiv  ity  ol  dielectric  1 

FPSo.\2:  permittivity  of  dielectric  2 

1  PSpass:  permittivity  of  the  passivation  dielectric 

I  PSsi:  permittivity  of  silicon 

\I  intrinsic  carrier  concentration  of  the  semiconductor  1.45*10**10  'cm'*3  at  300k 
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KEQX'.  averaging  factor  for  junction  capacitance  calculation  for  NMOS  devices 

KEQP:  averaging  factor  for  junction  capacitance  calculation  for  PMOS  devices 

XDN:  drain,  source  X  doping  for  NMOS  devices 

\DP:  drain,  source  P  doping  for  PMOS  devices 

XSL’B:  n-type  substrate  doping 

NPWELI.:  p-well  doping 

TO\:  oxide  thickness 

\JP:  metallurgical  junction  depth  for  P  diffusion 

XJX:  metallurgical  junction  depth  for  N  diffusion 

LDP:  P  type  lateral  diffusion 

L.DN:  X  type  lateral  diffusion 

VT:  thermal  voltage 

Q:  electronic  charge 

C’JOP  zero-bias  bulk  junction  bottom  cap.  per  unit  area  for  P  diff 

CJOX:  zero-bias  bulk  junction  bottom  cap.  per  unit  area  for  X  diff 

COX:  gate  oxide  cap.  per  unit  area 

PBP:  built-in  junction  potential  for  P  diff 

PBX:  built-in  junction  potential  for  X  diff 

CCiSOP:  gate-source  overlap  cap.  per  meter  channel  width  for  PMOS  devices 

C'CiSOX:  gate-source  overlap  cap.  per  meter  channel  width  for  XMOS  devices 

Rth:  lower  threshold  for  resistance 

Rthseg:  higher  threshold  for  resistance 

Cselfth:  threshold  lor  self-capacitance 

Ccoupth:  threshold  for  coupling  capacitance 

XETCiAP:  threshold  lor  the  net  distance  -  used  in  coupling  capacitance  calculation 
HR  AXC'IKiAP:  threshold  for  the  branch  distance  -  used  in  coupling  capacitance  calculation 


v:- 
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HoriScale:  scaling  factor  for  the  layout  -  only  horizontal 
DFI.AYTH:  threshold  for  the  delay  used  in  the  node  reduction 

GLITCHTH:  threshold  for  voltage  glitch  -  for  filtering  negligible  effect  of  coupling  capacitance 
Cnj:  capacitance  per  unit  area  for  Y  diff  junctions 
C'njp:  capacitance  per  unit  length  for  Y  diff  junctions 
( 'pj:  capacitance  per  unit  area  for  P  diff  junctions' 

Cpjp:  capacitance  per  unit  length  for  P  diff  junctions 
Cm2ml:  capacitance  per  unit  area  between  metal  1  and  metal  2 

Cm2p:  capacitance  per  unit  area  between  metal  2  and  poly 

Cmlp:  capacitance  per  unit  area  between  metal  1  and  poly 

Cm  Id:  capacitance  per  unit  area  between  metal  1  and  diffusion 

Cm2of:  capacitance  per  unit  area  between  metal  2  and  field  oxide 
Cmlof:  capacitance  per  unit  area  between  metal  1  and  field  oxide 
Cpof:  capacitance  per  unit  area  between  poly  and  field  oxide 
featuresized:  feature  size  threshold  for  diffusion 
fealuresizep:  feature  size  threshold  for  poly 
featuresizem  1 :  feature  size  threshold  for  metall 
featuresizem2:  feature  size  threshold  for  metal  2 
offset[l ..nmsklvls]:  offset  distance  for  each  mask  level 
(*  f'YD  OF  PROCFSS  PARAMETERS  *) 

A.3.  Running  HPEX 

To  run  HPFY  the  command  line  would  look  as  follows: 

hpex  [  -  [w]  [o]  [h]  [e]  ]  ciffile  [outfile]  [logfile] 


HPFX  gets  command  line  arguments  for  program  use.  The  four  fields  after  hpex  are  defined  as 
(options)  :  A  minus  sign,  followed  by  the  letter  w.  or  the  letter  >> .  or  the  ieller  h  or  the  letter 


V  s 


NTW’VA"  V  V 


y ».|  Si  ‘if  *•*  ».li«;. liii **dl 


'rr~w.m~w^  vi  v.v.^  "/w" 


’.MV 


f 


I 

* 
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e  .  or  any  combinalion  of  these 
-  w 

turn  oil  all  warnings  The  delault  is  to  send  warnings  to  the  log  file 

-o 

cause  tree  optimization  to  be  performed  If  omitted  the  default  is  not  to  per 
form  tree  optimization, 
h 

turn  on  a  hierarchical  extraction.  The  default  is  to  perform  a  flat  extraction 
-e 

cause  nonblank  characters  following  the  CIT  Fnd  command  to  generate  a  warn¬ 
ing  only.  The  default  is  to  generate  an  error. 

'ciffilel  The  name  of  the  CII;  file  to  be  used  as  input  to  the  extraction  program 
loutfilel  The  name  cf  the  output  file  to  which  the  extracted  information  is  to  be  written  If 
this  field  is  omitted,  the  default  is  to  send  the  listing  information  to  the  file 
ciffile.out. 

'logfilel  The  name  of  the  log  file  to  which  the  listing  information  is  to  be  written.  If  this  field 
is  omitted,  the  default  is  to  send  the  listing  information  to  the  file  'ciffile.log.' 


A.4.  Example 


The  CIF  file  "inv.eif"  of  a  CMOS  inverter  shown  in  Figure  A  1  is  described  as  follows: 


I)S  10  1  2: 

9  inv  1 ; 

I.  CW; 

13  4 #00  90(X)  -3600  -3900. 
I  CM: 

13  4SOO  1500  -3600  6450: 
13  1SOO  24(H)  -3600  2700; 
13  2100  15(H)  -3450  750; 


B  1800  2400  -3600  -1200 
B  1800  1800  -3600  -4800 
B  4800  1500  -3600  -6450 
L  CP: 

B  3600  600  -3900  4800: 

B  900  3300  -5250  2850: 

B  1200  900  -5400  750: 

B  900  3000  -5250  -1200: 
B  1500  300  -3150  1350: 

B  2700  900  -2550  750: 

B  1500  300  -3150  150: 

B  3600  900  -3900  -3150: 
1.  CD: 

B  1800  4800  -36(30  48(H): 
B  1800  6300  -36(H)  -4050 
L  CC: 

B  600  600  -3600  6450: 

B  6(H)  600  -36(H)  3150: 

B  600  600  -36(H)  -1650: 

B  600  600  -36(H)  -4950: 

B  600  600  -3600  -6450: 


y.s 
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B  600  600  31 50  750; 

1  US 

B  3000  6000  3600  4H00. 

B  3oo()  2100  3600  '6750. 

99  lgnd  *6000  -66(H)  5. 

99  rgnd  12(H)  -6600  5 
99  inp  6(K)0  600  3. 

99  out  1200  6(H)  3; 

99  lvdd  -6000  6300  5; 

99  rvdd  -1200  63(H)  5 
[)I 

C  10  '1'  0.0; 

1  rid 

A  typical  run  for  HPFX  is  to  type  the  following  command  line: 
hpe.x  -h  inv.cif  inv  out  . 

Following  this,  several  interactive  questions  have  to  be  answered  by  the  user  to  complete  the 
input  session.  In  this  example,  the  lollowmg  questions  appear  on  the  terminal  screen  sequen¬ 
tially. 

Which  file  do  you  like  to  generate-1 
(  1  )  MONTI M  input  file 
( 2 )  SPICF  input  file 
OKiFMIM  inpuifile 
(4)  l  ogic  block  description 
Enter  choice  2 

Do  you  like  to  use  measured  parameters  in  capacitance  computation-1  [n]  CR 
Do  y  ou  like  to  calculate  coupling  capacitances7  [n]  y  CR 
Do  s  ou  like  to  perform  node  reduction-1  [n]  CR 
Do  vou  like  to  generate  SPICF  model  cards7  [n]  CR 

It  vhould  be  noted  that  the  answer  "no"  is  defaulted  to  each  question  when  we  hit  the  carriage 
return  After  the  execution  is  completed,  the  output  file  "inv. out"  looks  as  follows: 


*  INPUT  Fil  l  inv  cil 

*  EXTRACTION  DAT!  10  Aug  S7  TIME  15:33:31 


I  . JLfc  AV  2 -A  Ait  /Jfc  *-* 


w’/VV 
■  .  *  •  *  *  *  «  •  '*, 


J*  . 
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*  INPUT  DECK  FOR  SPICE 

— 

.subckl 

inv  1 

13  12  16  17  9  22 

*rgnd  : 

13 

*]gnd  : 

12 

,  # 

*rvdd  : 

16 

*lvdd  : 

17 

« 

'out  :  9 

I 

*inp  :  22 

b  ' 
*  « 

'total  no.  of  transistors  :  2 

*  < 

*  NET 5 

rOOOl 

20 

22  2.500e+01 

r()()02 

20 

19  1.587e+02 

m  , 

’  f 

r(X)03 

18 

20  1.8S9e+02 

cOOOl 

18 

0  0.0035  lpf 

c()002 

22 

0  0.00052pf 

> 

c()003 

20 

0  0.00693pf 

*  ' 

c()0()4 

19 

0  0.00364pf 

*  NET4 

r()()04 

15 

16  9.600e-02 

r()005 

15 

17  9.600e-02 

r()006 

14 

15  5.857e+00 

(.0005 

14 

0  ().00042pf 

c0006 

17 

0  0.001 6  7pf 

c00()7 

16 

0  0.001 6  7pf 

. 

c0008 

15 

0  0.003  75  pf 

*  NET 3 

■ 

r0007 

11 

13  9.600e-02 

rOOOS 

11 

12  9.600e-02 

*  \ 

r()009 

10 

11  1  004e+01 

rOOK) 

10 

11  1.006e+01 

c0009 

10 

0  0.00265  pf 

COOK) 

12 

0  0.001 6  7pf 

S'. 

cOOll 

13 

0  0.00167pf 

cOOl  2 

1 1 

0  0.005  98  pf 

*  NET2 

rOOll 

8 

7  5.935e+(X) 

rO()12 

8 

9  7.002e+01 

rOOl  3 

6 

8  5.101e+00 

cOOl  3 

6 

0  0.00250pf 

> 

cOOl  4 

9 

0  0.(XJ297pf 

c()015 

7 

0  0.00250pf 

c0016 

8 

0  0.0079Spf 

c 

m  1 

7  18 

14  1  pdev  1=  3.(X)u  w=  9.<X)u 

+  ad=  94.50p 

as*  94.50p 

.V 

m2  10  19  6  0  ndev  1=  4.50u  w=  9.00u 


+  ad=  162.(X)p  as=  81.(X)p 

* 

ends  invl 

xl  12  10  13  1 1  9  8  invl 

v>, 

*Readin  time 

0.42 

'Geometric  extraction  time 

1.13 

'Total  job  time 

1.55 

y 

y 


In  the  output  file,  the  user-specified  node  names  are  listed  with  their  corresponding  integer 
node  numbers.  This  information  will  help  users  to  identify  iio  node  numbers  in  the  simula¬ 
tion  phase  Since  the  output  file  is  SPICL-input  compatible,  it  is  ready  for  simulation  except 
that  users  have  to  supply  the  input  waveforms. 
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